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SEQUENCE LISTING 



(1) GENERAL INFORMATION: 

(i) APPLICANT: Zsebo, Krisztina M. 

Bosselman, Robert A. 
Suggs, Sidney V. 
Martin, Francis H. 

(ii) TITLE OF INVENTION: Stem Cell Factor: Composition Claims 

(iii) NUMBER OF SEQUENCES: 104 

(iv) CORRESPONDENCE ADDRESS: 

\A/ ADDRESSEE: Mars nail, u ' TooiG , Gsrstsin , Murray 6* Borun 

(B) STREET: 6300 Sears Tower, 233 South Wacker Drive 

(C) CITY: Chicago 
CD) STATE: Illinois 

(E) COUNTRY : United States of America 

(F) ZIP: 60606-6402 

(v) COMPUTER READABLE FORM: 

(A) MEDIUM TYPE: Floppy disk 

(B) COMPUTER: IBM PC compatible 

(C) OPERATING SYSTEM: PC - DOS /MS -DOS 

(D) SOFTWARE: Patentln Release #1.0, Version #1.30 

(vi) CURRENT APPLICATION DATA: 

(A) APPLICATION NUMBER: 

(B) FILING DATE: 

(C) CLASSIFICATION: 

(vii) PRIOR APPLICATION DATA: 

(A) APPLICATION NUMBER: 09/005,893 

(B) FILING DATE : 1 2 - JAN - 1 9 9 8 

(C) CLASSIFICATION: 



(vii) PRIOR APPLICATION DATA: 

(A) APPLICATION NUMBER: 08/449,653 

(B) FILING DATE: 24-MAY-1995 

(C) CLASSIFICATION : 

(vii) PRIOR APPLICATION DATA: 

(A) APPLICATION NUMBER: 07/982,255 

(B) FILING DATE: 25-NOV-1992 

(vii) PRIOR APPLICATION DATA: 

(A) APPLICATION NUMBER: 07/589,701 

(B) FILING DATE: 01-OCT-1990 



(vii) 



PRIOR APPLICATION DATA: 

(A) APPLICATION NUMBER: 07/573,616 

(B) FILING DATE: 24-AUG-1990 
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(vii) PRIOR APPLICATION DATA: 

(A) APPLICATION NUMBER: 07/537,198 

(B) FILING DATE: ll-JUN-1990 

(vii) PRIOR APPLICATION DATA: 

(A) APPLICATION NUMBER: 07/422,383 

(B) FILING DATE: 16-OCT-1989 



(viii) ATTORNEY/AGENT INFORMATION: 

(A) NAME: Clough, David W. 

(B) REGISTRATION NUMBER: 36,107 

(C) REFERENCE /DOCKET NUMBER: 01017/35136 



<ix) TELECOMMUNICATION INFORMATION: 

(A) TELEPHONE : 312/474-6300 

(B) TELEFAX: 312/474-0448 

(C) TELEX: 25-3856 



(2) INFORMATION FOR SEQ ID NO : 1 : 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 16 5 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 1 : 



Glu Glu lie Cys 
1 

Lys Leu Val Ala 
20 

Val Ala Gly Met 
35 

Val Thr His Leu 
50 

Asn lie Ser Glu 

65 

Lys lie Val Asp 



Asn Val Lys Glu 
100 



Arg Asn Pro Val 
5 

Asn Leu Pro Asn 



Asp Val Leu Pro 
40 

Ser Val Ser Leu 
55 

Gly Leu Ser Asn 
70 

Asp Leu Val Ala 
85 

Ser Leu Lys Lys 



Thr Asp Asn Val 
10 

Asp Tyr Met lie 
25 

Ser His Cys Trp 



Thr Thr Leu Leu 
60 

Tyr Ser lie lie 

75 

Cys Met Glu Glu 
90 

Pro Glu Thr Arg 
105 



Lys Asp lie Thr 
15 

Thr Leu Asn Tyr 
30 

Leu Arg Asp Met 
45 

Asp Lys Phe Ser 



Asp Lys Leu Gly 
80 

Asn Ala Pro Lys 
95 

Asn Phe Thr Pro 
110 



Glu Glu Phe Phe Ser lie Phe Asn Arg Ser lie Asp Ala Phe Lys Asp 
115 120 125 
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Phe Met Val Ala Ser Asp Thr Ser Asp Cys Val Leu Ser Ser Thr Leu 
130 135 140 

Gly Pro Glu Lys Asp Ser Arg Val Ser Val Thr Lys Pro Phe Met Leu 
145 150 155 160 

Pro Pro Val Ala Ala 
165 

(2) INFORMATION FOR SEQ ID NO : 2 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 26 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(ix) FEATURE: 

(A) NAME / KEY : modif ied_base 

(B) LOCATION: 9 

(D) OTHER INFORMATION: /mod_base= Inosine 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 2 : 
ACRTTYTTNG GNGCRTTYTC YTCCAT 26 
(2) INFORMATION FOR SEQ ID NO : 3 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(ix) FEATURE : 

(A) NAME /KEY : modi f ied_base 

(B) LOCATION: 12 and 15 

(D) OTHER INFORMATION: /mod_base= Inosine 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 3 : 
AARAAYTCYT CNGGNGTRAA RTT 23 
(2) INFORMATION FOR SEQ ID NO : 4 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 14 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



# 
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(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 4 : 
GTYTCNGGYT TYTT 

(2) INFORMATION FOR SEQ ID NO : 5 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 26 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

( i i ) MOLECULE TYPE : DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 5 : 
ATGGARGARA AYGCCCCCAA RAAYGT 
(2) INFORMATION FOR SEQ ID NO : 6 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 0 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY : 1 inear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 6 
CCNAAYGAYT AYATGWTMAC 
(2) INFORMATION FOR SEQ ID NO : 7 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 0 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 7 
GGNGGNARCA TRAANGGYTT 
(2) INFORMATION FOR SEQ ID NO : 8 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 3 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 
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( l i ) MOLECULE TYPE : DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 8 : 
AC C AKAARAT CTTYAAANCG ATC 
(2) INFORMATION FOR SEQ ID NO : 9 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 22 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

( D ) TOPOLOGY : 1 inear 

( i i ) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 9 : 
GTATTTTCAA TAGATCCATT GA 
(2) INFORMATION FOR SEQ ID NO: 10: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 14 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

( i i ) MOLE CULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 1 0 : 
CCAACTATGT CGCC 

(2) INFORMATION FOR SEQ ID NO : 11 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 11 
GTAGTCAAGC TGACTGATAA G 
(2) INFORMATION FOR SEQ ID NO: 12: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



# 
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(ii) MOLECULE TYPE : DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 12: 
TAACCAACAA TGACTAGGCA A 
(2) INFORMATION FOR SEQ ID NO: 13: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 16 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

{ i i ) MOLECULE TYPE : DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 13: 
TTCCAGAGTC AGTGTC 

(2) INFORMATION FOR SEQ ID NO : 14 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 9 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 14 
GCGAAGCTTG CCTTTCCTTA TGAAGAAGA 
(2) INFORMATION FOR SEQ ID NO: 15: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 38 base pairs 

(B ) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 1 5 
GCGCCGCGGT TACGGTGGTA ACATGAAGGG CTTTGTGA 
(2) INFORMATION FOR SEQ ID NO: 16: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 21 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY -. linear 
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(ii) MOLECULE TYPE: DNA 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 16: 
GATAAATGCA AGTGATAATC C 
(2) INFORMATION FOR SEQ ID NO: 17: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 36 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 
<D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: UNA 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO:l7: 
GCGGTCGACC CGCGGAACTT TAAGTCCATG CAACAC 
(2) INFORMATION FOR SEQ ID NO:18: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 36 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 18: 
CACCCGCGGT TATGCAACAG GGGGTAACAT AAATGG 
(2) INFORMATION FOR SEQ ID NO: 19: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 36 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:19 
CACCCGCGGT TAGGCTGCAA CAGGGGGTAA CATAAA 
(2) INFORMATION FOR SEQ ID NO : 2 0 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 18 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 
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(ii) MOLECULE TYPE: DNA 

(xx) SEQUENCE DESCRIPTION: SEQ ID NO : 2 0 : 
CTTAATGTTG AAGAAACC 

(2) INFORMATION FOR SEQ ID NO : 2 1 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 22 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 21: 
GATGGTAGTA CAATTGTCAG AC 
(2) INFORMATION FOR SEQ ID NO: 22: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 22 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 22: 
GTCTGACAAT TGT ACT AC C A TC 
(2) INFORMATION FOR SEQ ID NO:23: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 22 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 23 
CAATTTAGTG ACGTCTTTTA CA 
(2) INFORMATION FOR SEQ ID NO: 24: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: DNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 2 4 : 
TTAGATGAGT TTTCTTTCAC GCAC 
(2) INFORMATION FOR SEQ ID NO : 2 5 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 base pairs 

(B) TYPE : nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 25 
AAATCATTCA AGAGCCCAGA ACCC 
(2) INFORMATION FOR SEQ ID NO: 26: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 18 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

( D ) TOPOLOGY : 1 inear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 2 
AA CATC CATC CCGGGGAC 

(2) INFORMATION FOR SEQ ID NO: 27: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 9 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO:: 
CTGGCAATAT TTTAAGTCTC AAGAAGACC 
(2) INFORMATION FOR SEQ ID NO: 28: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 29 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY : linear 
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(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 28: 
GCGCCGCGGC TCCTATAGGT GCTAATTGG 
(2) INFORMATION FOR SEQ ID NO: 29: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 27 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

( i i ) MOLECULE TYPE : DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 2 9 : 
CCTCACCACT GTTTGTGCTG GATCGCA 
(2) INFORMATION FOR SEQ ID NO : 3 0 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 31 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

( i i ) MOLECULE TYPE : DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 3 0 : 
GGTGTCTAGA CTTGTGTCTT CTTCATAAGG A 
(2) INFORMATION FOR SEQ ID NO: 31: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 3 1 
CCCCCCCHGG 

(2) INFORMATION FOR SEQ ID NO: 32: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 
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(ii) MOLECULE TYPE: DNA 

(Xl) SEQUENCE DESCRIPTION: SEQ ID NO: 32: 
TTTTTTTTTT TTTTTTTTGG 
(2) INFORMATION FOR SEQ ID NO : 3 3 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE : DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 3 3 : 
TTTTTTTTTT TTTTTTTTAG 
(2) INFORMATION FOR SEQ ID NO: 34: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 0 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 34: 
TTTTTTTTTT TTTTTTTTCG 
(2) I NFORMAT I ON FOR SEQ ID NO : 3 5 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

( D ) TOPOLOGY : linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 3 5 
TTCGGCCGAT CAGGCCCCCC CCCC 
(2) INFORMATION FOR SEQ ID NO: 36: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 30 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 
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(ll) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 3 6 : 
TTCGGCCGGA TAGGCCTTTT TTTTTTTTTT 30 
(2) INFORMATION FOR SEQ ID NO: 37: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 37: 

24 

GGCCGGATAG GCCTCACNNN NNNT 
(2) INFORMATION FOR SEQ ID NO: 38: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 17 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 38: 

17 

GGCCGGATAG GCCTCAC 

(2) INFORMATION FOR SEQ ID NO : 3 9 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4673 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(ix) FEATURE: 

(A) NAME /KEY 

(B) LOCATION 
3692 . 

(ix) FEATURE: 

(A) NAME /KEY 

(B) LOCATION 

3692 . 



CDS 

jo 
3774) 



in(660..773, 1184 1246 , 2053. .2223, 2837. 



, 2993 , 



: mat peptide 

: jorn(720. .773, 1184 1246 , 2053. .2223, 2837. .2993, 
.3774) 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 3 9 : 
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AAAGTATCTT TCTATTGGCG AAGGACATGT TTTCCCATAA GTGGTAAACA AACTGTCTGC 

ACATAATAAT TATCTTGCTG CCGTAAAGAT TAGGTTAAAT TCTGCCTTCG AT C T AAAAAC 

ACACCCTTCT GTCAATCCGA GGAGCAGTGT GCTAGTCTAG AGGTCTAAAT GAAGGCTCCT 

TTCACGGTTG TATTTCTGCT CCCCAAATTG TCCACATTTA AAAGGAGAGT GCTTCTTTTC 

AGCCTTAGGC TCTGAATTTC ATGCATTCCT CCATTTTCCG AGGTCCCCCC CAAGTGATAA 

TTCTGTTACA CGTTGCTACA AGTTCATCCC TAATTGCCGT CAAGAAACTG ACTGTAGAAG 

GCTTACCACA GACGTTGTAA CCGACAGTAA AGCCATTGAA AGAGTAATTC AAACAGGATG 

GAAGCCAGGA GTATTTTGTG GCTGTTGCTC TTTTTCTTTT CAGTTTGGTG AGAGCAGCTT 

GAATGCTTAA CATTTAAGCC ATCAGCTTAA AACAAAACAA AACAAAACAA AAAAAAACCC 

CGCTCTGGCA TATTTGCACT TAACACATAC GGTATAAGGT GTTACTGGTT TGCATAGTTC 

TGGATTTTTT TTTTTTAAAA ACTGATGGAC AC C AAGAAAT GTTTCTGTTC TTTGTTTAG 

ACT TGG ATT ATC ACT TGC ATT TAT CTT CAA CTG CTC CTA TTT AAT CCT 
Thr Trp He He Thr Cys He Tyr Leu Gin Leu Leu Leu Phe Asn Pro 
-20 -15 " 5 

CTC GTC AAA ACT CAG GAG ATC TGC AGG AAT CCT GTG ACT GAT AAT GTA 
Leu Val Lys Thr Gin Glu He Cys Arg Asn Pro Val Thr Asp Asn Val 
1 5 10 

AAA GAC ATT ACA AAA CTG GTAAGTAAAG AATGATTTTG GCATCTATAA 
Lys Asp He Thr Lys Leu 
15 

GTCTTCCCTG TGCTTGCTGA CCACATAGGT TCAGGGCACT C C C GAC AGG A GTTCCCAGCT 

TTCTAAGATA AGGAATCACT GTACGAGTCT GAAGTGCTTC TTCTGGGCAA ATGGGAGATG 

CTTAGGTCAT GGAGGGTTTA TC TGTATAAC TGGCCCTTTG CACACCAACA AAGTGACTGA 

CTGGCTTTTG CCTGTTACCT ACTGTCTCCA GTCCTGGGCA TGGTATATAC TTAGGCACCC 

AAGATTGGAT TTACAACTCA AGCATTATAT ATTGGACAAC ACGGGGTATG AG AT AT T AAT 

GATATGTCAG GTTGGATGGA TGAGTTTTCT CAAGAAATTC TCTTGTATTT ACTCACGTTT 

TCATTTCTTG GTCTCTGTAG GTG GCG AAT CTT CCA AAT GAC TAT ATG ATA 

Val Ala Asn Leu Pro Asn Asp Tyr Met He 
20 25 

ACC CTC AAC TAT GTC GCC GGG ATG GAT GTT TTG GTATGTAGTC CACACACTTC 
Thr Leu Asn Tyr Val Ala Gly Met Asp Val Leu 

30 3 5 
TGAGTTGCCT TTTAGTAGCT AATGGGTGAC CTGTGCTTAT TCACATTGAA GACATTATTT 



60 
120 
180 
240 
300 
360 
420 
480 
540 

600 

659 

707 

755 

803 

863 
923 
983 
1043 
1103 
1163 
1213 

1266 

1326 
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GCTCTTTGTC GTTTTTAGAT GTTGAC CTAT AATTTTTCCT TCAAGCTGCT GCTAAGATTA 

TCAGTGAGCA TTTCAGTATG TGTTTTAAGC CTACTCATTA AAAGGAAATG GCTCATCTTA 

GACGTAGCAA CCGATGTTAA TTTTTCCCCA GGCATCTCTC AGAGGGACTT GAATGTTAAA 

ATCATGTTAA ATTTCCTCCT TGGCTATGTT ATTTCTCATG GCTATGTTAT TCCTATTCGT 

ATTTCATTTA AAGGGACGGA ATATTTATTG TATTTCTGAA CTTTTTCAGG CATGCATCCG 

GGTCTTTGAA TAAAACACTA AGACTCCTTC TAGTAATGTT TGTAATCCTG TCTGTATCGA 

ATGT CTTTG A AAACGCAGTG ACTAAGCCAT AAATAATCTT CCACAGAACG TCCAGTGGTT 

CATGAACTTT GTATGTGGGG GTGGGGCAAG AATTGTCTCA CTATTGGTCA AGGAAGAGAA 

GGTAAGGTAT GCAAGGGTGG TTTAATCTTC TTCCGTGGAA GGACAAAATC ATCTATCATT 

TCCTCTGATC TCTATGCATT TGTTTGTTTT GAACTGAATC TGACTTGAGC AAGAGTTGGC 

GTCCTGTGTT CTGAGGAAAC TCTTTGTCCT GCAGTCAGTG ACTAAAAGTG C TG AG AG AT C 

TGAAGAGCAC TCTGAATCTG C CAT ATT TT T AATAGATGCT TTGTCTTCTC TTTGAATTTC 

TTCCAG CCT ACT CAT TGT TGG TTA CGA GAT ATG GTA ACA CAC TTA TCA 
Pro Ser His Cys Trp Leu Arg Asp Met Val Thr His Leu Ser 
40 45 

GTC AGC TTG ACT ACT CTT CTG GAG AAG TTT TCA AAT ATT TCT GAA GGC 
Val Ser Leu Thr Thr Leu Leu Asp Lys Phe Ser Asn He Ser Glu Gly 
55 60 65 

TTG AGT AAT TAT TCC ATC ATA GAC AAA CTT GGG AAA ATA GTG GAT GAC 
L eu Ser Asn Tyr Ser He He Asp Lys Leu Gly Lys He Val Asp Asp 

CTC GTG GCA TGT ATG GAA GAA AAT GCA CCT AAG GTAACTTGGT ATTCATCAGA 
Leu Val Ala Cys Met Glu Glu Asn Ala Pro Lys 
90 95 

ATTATTTTTC TTATACTGAG CTCATGATGA GCAATTCACA ACCACTTGTA ATTCCAGCTC 

CAGAGGACAT TATCCCCTCT TTGGATGCCA TAGGAATCTG CTCTCAAATA TG TAG AT AC C 

ACCTCTGCCA CCTCAGCACA TACATACACA TAATTAAAAA ATAGAAACAT TAAAGGAGTT 

C C AAT C AAT C CTTATT CTTT TCTGTATTCA GTATGCCCAG ATGTAAATTC TAGGAATATG 

TTT T AAAGG C TAATTCTTAT TTTGTAATAA GCAGCTTTAA AATTCTTAAT TGTTTTTTCG 

GGTCACTTTA TTGTCCTATT GCCACGACAT TGTCCTGTCC CATTGTCTGT TATTCCTTCT 

GTTTTGTTTA TTGTTCCCTA GTTACTTTGA TCATGAGATT GACCTGTTAC CCGTTGTTAT 

TCTCTGTAGC CATTTTGAGT TGTGTCTATT AGAACAGCTG TTAAATTACT TGAATCATTG 



1386 

1446 

15 06 

1566 

1626 

1686 

1746 

1806 
1866 
1926 
1986 
2046 
2094 

2142 

2190 

2243 

2303 

2363 

2423 

2483 

2543 

2603 

2663 

2723 
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AGGACATAGT CAATAATCTA TTATGCTGAT CCAGTCAAGT CTATGAGTTA TTTGAAAACT 

AGAATCTTTG TTAATTATTT GTTTGCTTGT TTGTTTGTTT ATTATTTGTC TAG AAT 

Asn 



GTA AAA GAA TCA CTG AAG AAG CCA GAA ACT AGA AAC TTT ACT CCT GAA 
Val Lys Glu Ser Leu Lys Lys Pro Glu Thr Arg Asn Phe Thr Pro Glu 
100 105 HO 

GAA TTC TTT AGT ATT TTC AAT AGA TCC ATT GAT GCC TTC AAG GAC TTC 
Glu Phe Phe Ser He Phe Asn Arg Ser He Asp Ala Phe Lys Asp Phe 
115 120 125 

a -rr, r,TR CCA TCT GAC ACT AGT GAT TGT GTG CTC TCT TCA ACA TTA GGT 
Met Val Ala Ser Asp Thr Ser Asp Cys Val Leu Ser Ser Thr Leu Gly 
130 135 "0 145 

CCT GAG AAA G GTAAGGCTTT TAAGCATTTC TTGTTTAAAT GTACATAGAA 
Pro Glu Lys 



2783 
28 3 9 

2887 

2935 

2983 

3033 



AGCCTGAACT TCTGTAAGCC TCTACTGCTG AATCAACTAA ATGTGTTGCT GTAGAAAGAA 

CGTGTGGGTT TTTCTGATAA AAACAAAAAG CAAATATCAA TGACTACCAA TGATTATTAT 

CTAGCTTGAG AGATATGCCC TAAGACAGCG ATTCTCGATA TTTCTAAATT AAAGAATTGT 

GTGATGGTGG C T C AC AT ATT TTCTAACTGT GATATTTGCC AGGAGAGTAG AATAATGTTA 

TTCTTCATCC CCAGAATTCC TAAGATTTCA CGTCTCATGT CTTTTCCATA AGGTTCAAAC 

TCTGAGACTT GAGTTCTGAG CCTCAGCAGG TCATTCTGAA TCCCCACTCT CCCCGAGCTG 

GGTCCCTATG GGGGAACTAA CTTCATTGCT TTCTTTTAAA ACATGACGAG TTACCAACAG 

CTCCTCGCTA TTATAAACAT GTTCCTAAGC ATGTCTGTGC ATGCAATAAG CCTTCACTCT 

ACAAGACAGT TATGGTGTAT CGCTTGACAA AACTGAGCAG CCAAGCTGAG TATGAAATAA 

TAATCTAGAC TTGGGAGGCA GACCCAGCAC CTACTGTGAT ATTGCACTTC GCCTTTGGGG 
GACTCTATGA TTCAAAAGTT CACCATGTAA CACTGACACA TTATTGCTTT CTATTTAG ^AT 36 93 



3093 

3153 

3213 

3273 

3333 

3393 

3453 

3513 

3573 

3633 



TCC AGA GTC AGT GTC AGA AAA CCA TTT ATG TTA CCC CCT GTT GCA GCC 
Ser Arg Val Ser Val Arg Lys Pro Phe Met Leu Pro Pro Val Ala Ala 
150 155 160 

AGT TCC CTT AGG AAT GAC AGC AGT AGC AGT AAT AGTAAGTACA CATATCTGAT 
Ser Ser Leu Arg Asn Asp Ser Ser Ser Ser Asn 
170 175 

TTACTGCATG CATGGCTCCA AGTATCCTCT ATAGGAGTGT TGCATGGACT TAAAGTTTAT 



3741 



3794 



3854 



-202- 

AAATCACTAC TAATAATGCT GTTCTGTCAC TGTTATTCCT TGTATGGGCT TCCTGACAAT 3 914 

TAAATATCTG GTTTGTAGAA TCGGATCTCC TTAGAGGTTA AGATGACCAT GACAAAATTA 3 9 74 

GGCC7AATCAA CTTTCTGCGA AGGTTATTTT AAATAAGGCA CGAAATTAAT TG7AAGGAAAA 4 0 34 

AAAAATAC7AA GCAAGGCCTT ATTTTGAATC ATGGTAGGCT TAAAATAGAC TTTGTGGAGA 4 0 94 

ATGTCCCTGA TCAAAGTGGA GTTTTCAGAT TTCAAGTGCA TGTGCTAACT CTCCACAATG 4154 

TCAAGGCTAT TTTCAGTTTT GTGTCTCCAT ATTTACTACT GCATGTTTGG AAATTTGCTG 4 214 

ATGCTGTTAG ATTACCTAAG AATGTATGTT GAAGAAGAAT GGACTTCTTT CCCTAAAATT 42 74 

TCTGTCCTCT TTGCCCAAGA ACCCACGTTC CTGGAAGACT ATCTTATTTT CATGTCTGTG 4 3 34 

CAATGATCAT TATAAAGATT ATTGAATATA CTGGGAATAC TCTGGTTTCT GTTTTTACAG 43 94 

ATTCAT7AATA GCTTATTCAG TCTTTAAAGA AAGTTCTCTG 7AAGTCCATGC TTTAGAATGT 44 54 

TTCTCTATCA AAACTTGACC TGGACCTTAA ATAAAGCTAT ATTTAGTCTT TTTATCCCTG 4 514 

AA7AAATATAT TTCACAGTGT AGACATTTGA TATACATCTA AGGGAAGGAT GCTGCCAG7AA 4 5 74 

TGCTCGGGCT GGCAGTCTAC AAAGTCCACT GCTCTCAGGA TGGACTTCTG AAAGCGGAAA 46 34 
TTGTGAACTG C ATG CAT AT A ACATATCAGA TCCTCGAGC 
(2) INFORMATION FOR SEQ ID NO: 40: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 196 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 40: 

Thr Trp lie lie Thr Cys He Tyr Leu Gin Leu Leu Leu Phe Asn Pro 
-20 "15 -10 " 5 

Leu Val Lys Thr Gin Glu He Cys Arg Asn Pro Val Thr Asp Asn Val 

15 10 

Lys Asp He Thr Lys Leu Val Ala Asn Leu Pro Asn Asp Tyr Met He 
15 20 25 

Thr Leu Asn Tyr Val Ala Gly Met Asp Val Leu Pro Ser His Cys Trp 
30 35 40 

Leu Arg Asp Met Val Thr His Leu Ser Val Ser Leu Thr Thr Leu Leu 
45 50 55 60 



4673 



-203- 

Asp Lys Phe Ser Asn lie Ser Glu Gly Leu Ser Asn Tyr Ser lie He 
65 70 75 

Asp Lys Leu Gly Lys He Val Asp Asp Leu Val Ala Cys Met Glu Glu 
80 8 5 90 

Asn Ala Pro Lys Asn Val Lys Glu Ser Leu Lys Lys Pro Glu Thr Arg 
95 100 105 

Asn Phe Thr Pro Glu Glu Phe Phe Ser He Phe Asn Arg Ser He Asp 
110 I" 120 

Ala Phe Lys Asp Phe Met Val Ala Ser Asp Thr Ser Asp Cys Val Leu 
125 

Ser Ser Thr Leu Gly Pro Glu Lys Asp Ser Arg Val Ser Val Arg Lys 
145 150 I 55 

Pro Phe Met Leu Pro Pro Val Ala Ala Ser Ser Leu Arg Asn Asp Ser 



160 



165 



170 



Ser Ser Ser Asn 
175 

(2) INFORMATION FOR SEQ ID NO: 41: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 849 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE : DNA 

(ix) FEATURE: 

(A) NAME / KEY : CDS 

(B) LOCATION: 26 . . 844 



(ix) FEATURE: 

(A) NAME /KEY : mat__peptide 

(B) LOCATION: 101.-844 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 41: 

CTGGATCGCA GCGCTGCCTT TCCTT ATG AAG AAG ACA CAA ACT TGG ATT ATC 

Met Lys Lys Thr Gin Thr Trp He He 
-25 -20 

ACT TGC ATT TAT CTT CAA CTG CTC CTA TTT AAT CCT CTC GTC AAA ACT 
Thr Cys He Tyr Leu Gin Leu Leu Leu Phe Asn Pro Leu Val Lys Thr 
-15 -10 " 5 

CAG GAG ATC TGC AGG AAT CCT GTG ACT GAT AAT GTA AAA GAC ATT ACA 
Gin Glu He Cys Arg Asn Pro Val Thr Asp Asn Val Lys Asp He Thr 
! 5 1° 15 



52 



100 



148 



-204- 



AAA CTG GTG GCG AAT CTT CCA AAT GAC TAT ATG ATA ACC CTC AAC TAT 
Lys Leu Val Ala Asn Leu Pro Asn Asp Tyr Met lie Thr Leu Asn Tyr 



20 



25 



GTC GCC GGG ATG GAT GTT TTG CCT AGT CAT TGT TGG TTA CGA GAT ATG 
Val Ala Gly Met Asp Val Leu Pro Ser His Cys Trp Leu Arg Asp Met 
35 40 45 

GTA ACA CAC TTA TCA GTC AGC TTG ACT ACT CTT CTG GAC AAG TTT TCA 
Val Thr His Leu Ser Val Ser Leu Thr Thr Leu Leu Asp Lys Phe Ser 
50 55 60 

AAT ATT TCT GAA GGC TTG AGT AAT TAT TCC ATC ATA GAC AAA CTT GGG 
Asn lie Ser Glu Gly Leu Ser Asn Tyr Ser He He Asp Lys Leu Gly 
65 70 " 80 

AAA ATA GTG GAT GAC CTC GTG GCA TGT ATG GAA GAA AAT GCA CCT AAG 
Lys He Val Asp Asp Leu Val Ala Cys Met Glu Glu Asn Ala Pro Lys 
85 90 95 

AAT GTA AAA GAA TCA CTG AAG AAG CCA GAA ACT AGA AAC TTT ACT CCT 
Asn Val Lys Glu Ser Leu Lys Lys Pro Glu Thr Arg Asn Phe Thr Pro 
100 105 HO 

GAA GAA TTC TTT AGT ATT TTC AAT AGA TCC ATT GAT GCC TTC AAG GAC 
Glu Glu Phe Phe Ser He Phe Asn Arg Ser He Asp Ala Phe Lys Asp 
115 120 125 

TTC ATG GTG GCA TCT GAC ACT AGT GAT TGT GTG CTC TCT TCA ACA TTA 
Phe Met Val Ala Ser Asp Thr Ser Asp Cys Val Leu Ser Ser Thr Leu 
130 135 140 

GGT CCT GAG AAA GAT TCC AGA GTC AGT GTC ACA AAA CCA TTT ATG TTA 
Gly Pro Glu Lys Asp Ser Arg Val Ser Val Thr Lys Pro Phe Met Leu 
145 150 155 I 60 

CCC CCT GTT GCA GCC AGT TCC CTT AGG AAT GAC AGC AGT AGC AGT AAT 
Pro Pro val Ala Ala Ser Ser Leu Arg Asn Asp Ser Ser Ser Ser Asn 
165 170 175 

AGG AAA GCC GCA AAG TCC CCT GAA GAC CCA GGC CTA CAA TGG ACA GCA 
Arg Lys Ala Ala Lys Ser Pro Glu Asp Pro Gly Leu Gin Trp Thr Ala 
180 185 19° 

ATG GCA CTG CCG GCT CTC ATT TCG CTT GTA ATT GGC TTT GCT TTT GGA 
Met Ala Leu Pro Ala Leu He Ser Leu Val He Gly Phe Ala Phe Gly 
195 200 205 

GCC TTA TAC TGG AAG AAG AAA CAG TCA AGT CTT ACA AGG GCA GTT GAA 
Ala Leu Tyr Trp Lys Lys Lys Gin Ser Ser Leu Thr Arg Ala Val Glu 
210 215 



0 



AAT ATA CAG ATT AAT GAA GAG GAT AAT GAG ATA AGT ATG TTG CAA CAG 
Asn He Gin He Asn Glu Glu Asp Asn Glu lie Ser Met Leu Gin Gin 
225 230 235 240 



196 



244 



292 



340 



388 



436 



484 



532 



580 



628 



676 



724 



772 



820 



-205- 

AAA GAG AGA GAG TTT CAA GAG GTG TAATT 
Lys Glu Arg Glu Phe Gin Glu Val 
245 

(2) INFORMATION FOR SEQ ID NO: 42: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 273 amino acids 

(B) TYPE : amino acid 
(D) TOPOLOGY : linear 

(ii) MOLECULE TYPE: protein 

<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 42: 

Met Lys Lys Thr Gin Thr Trp He He Thr Cys He Tyr Leu Gin Leu 
-25 -20 "15 -10 

Leu Leu Phe Asn Pro Leu Val Lys Thr Gin Glu He Cys Arg Asn Pro 
-5 1 5 

Val Thr Asp Asn Val Lys Asp He Thr Lys Leu Val Ala Asn Leu Pro 
10 15 20 

Asn Asp Tyr Met He Thr Leu Asn Tyr Val Ala Gly Met Asp Val Leu 
25 30 35 

Pro Ser His Cys Trp Leu Arg Asp Met Val Thr His Leu Ser Val Ser 
40 45 50 55 

Leu Thr Thr Leu Leu Asp Lys Phe Ser Asn He Ser Glu Gly Leu Ser 
60 65 70 

Asn Tyr Ser He He Asp Lys Leu Gly Lys He Val Asp Asp Leu Val 
75 80 85 

Ala Cys Met Glu Glu Asn Ala Pro Lys Asn Val Lys Glu Ser Leu Lys 
90 95 100 

Lys Pro Glu Thr Arg Asn Phe Thr Pro Glu Glu Phe Phe Ser He Phe 
105 HO 115 

Asn Arg Ser He Asp Ala Phe Lys Asp Phe Met Val Ala Ser Asp Thr 
120 125 130 135 

Ser Asp Cys Val Leu Ser Ser Thr Leu Gly Pro Glu Lys Asp Ser Arg 
140 145 150 

Val Ser Val Thr Lys Pro Phe Met Leu Pro Pro Val Ala Ala Ser Ser 
155 160 165 

Leu Arg Asn Asp Ser Ser Ser Ser Asn Arg Lys Ala Ala Lys Ser Pro 
170 175 180 



84 9 
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Glu Asp Pro Gly Leu Gin Trp Thr Ala Met Ala Leu Pro Ala Leu He 
185 190 195 

Ser Leu Val He Gly Phe Ala Phe Gly Ala Leu Tyr Trp Lys Lys Lys 
200 205 210 215 

Gin Ser Ser Leu Thr Arg Ala Val Glu Asn He Gin He Asn Glu Glu 
220 225 230 

Asp Asn Glu He Ser Met Leu Gin Gin Lys Glu Arg Glu Phe Gin Glu 
235 240 245 

Val 

(2) INFORMATION FOR SEQ ID NO: 43: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3807 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE : DNA 

(ix) FEATURE : 

(A) NAME /KEY : CDS 

(B) LOCATION: join ( 905 .. 1018 f 1914.. 1976, 2572.. 2742, 3152 

. .3307, 3513 . .3595) 

(ix) FEATURE: 

(A) NAME/KEY: mat_peptide 

(B) LOCATION: join (965 1018 , 1914.. 1976, 2572.-2742, 3152 

. .3307 , 3513 . .3595) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 43: 
CACAAGTGAG TAGGGCGCGC CCGGGAGCTC CCAGGCTCTC CAGGAAAAAT CGCGCCCGGT 
GCCCCGGGGA AGCCGGCGCT CCCTGGGACT TGCAGCTGGG GCGTGCAGGG CTGTGCCTGC 
CGGGTGAGAT ACTACAAAGA TAAATCAGTT GCACAAGTTC TTGAAACTCT ACAGTGTAAT 
AAGGAAAAAT AAGTCATGCA TAAAAGCAAC TATAATACAT AATAGAAAAT GTTATTTTCA 
AGCCGATGTG TAGGTTATGT GTGTTCGAGA GAGAGAGAGA GAAGACAGAT TACTTTCTGC 
TAGGGTTCAA GAATGCCTTC CTGTTGGCTA AGGAAATATT TTCCTTAAGT GGCTAAAAAG 36 0 

CTGTGTTTCA AAATATTCTT TTGATGTCTC ACAAATTCAG TGGAATTCTC TTAGGTCTAA 42 0 

AAATATACAT CTCTCTCACT TTAACTTGGT GTGCTATTGT AGATTATTGG ATTAAAGCAC 48 0 

TGCTCAGGGA TTATGCTGCT TCTTGCCAAG CAGTCTACAT TTAAAGTAGA AATAAGATGT 
TTCTTTTGGT GCCATAAGGT ATACATTTTA TGCATTCTCT AGTTTTTAGA AGATACCCTA 



60 
120 
180 
240 

300 



540 
600 



-207- 

AGGGCTAAGT CTTTAACATG CTGCTACAAG TTTATTCCTA ATTGCCATTG GGAAATTGGC 

TGAAGAAAGT TTTTAACAAA AGTTAACAAT ATTGTCATTG AGAGAATAAT TCAAAATGGA 

TTTTAACTAA AAGCTTTTAA AAACTTTGGT GAGCATAGCT TGAATGCGTA ATATTTAATT 

GCATTTAAGC CAATAACATA TATTAGACTG GTCTTTTTGT GCATCAAGGC ATTAGATGTT 

AAAAGTTTGA ATGATTACAG ATCTTAACTG ATGATCACCA AGCAATTTTT CTGTTTTCAT 

TTAG ACT TGG ATT CTC ACT TGC ATT TAT CTT CAG CTG CTC CTA TTT AAT 
Thr Trp He Leu Thr Cys He Tyr Leu Gin Leu Leu Leu Phe Asn 
-20 -15 

rrT rTC CTC AAA ACT GAA GGG ATC TGC AGG AAT CGT GTG ACT AAT AAT 
Pro Leu Val Lys Thr Glu Gly He Cys Arg Asn Arg Val Thr Asn Asn 
_ 5 1 5 10 

GTA AAA GAC GTC ACT AAA TTG GTAAGTAAGG AATGCTTTAC CGTGCTGTGT 
Val Lys Asp Val Thr Lys Leu 
15 

AAAAAAGAGC TGTGGCTCTT TTTCCTGTGC TTGTTGATAA AAGATTTAGA TTTTTCTTGC 

CCCAAAGTAA TGTTTTCCTA AAGTGGGGAA AGTAATCACT GGGTTACAAT AAAGGGTTTA 

TAGAAAGCAG GTAGTGAGAT ATTTAGGGTC ATGGATAATT TGTTGGTAAA ACTGGCTAGT 

TGCACACCAC TGCTGTGACT GCTTCTTTGC TGGTCTTCTC CCCATCCTTC ATAGGCAGTG 

AAGGACCTTG GAGAGTTCGC TGTGTGCTGA TGGGCTTGCC CCAGCTTGTT CCCCATAATC 

TCTCCAGTGG GTTTCC CAGC ATGTTCTATT CCCCTTCACA TGTCTTCCTA CTCTTCTTTA 

AAAAGCCTAA CGAAAGGAAA TCTGAAATGG CTATTCTCCC AATTCAATCA GCAGGAAGAC 

CCTGTCACAT GTCAGTGGGT GTTTGCTCCT TCAGGGAACA TAGAGAGGTG ATTCATTGCC 

CACATGTTGA AGGGACTCAT CTCCCTGGTT TGTCACATTG AACTCTTCCC TCAGCGAAAG 

CATTTGCATT GCTTCCCGAA TTCCAAGATC ACAGGTGGAA GCTGAAATTC AGATCATGTT 

TCCAAAACTC AGTAGGTTAT ACCTAGCCAG GCATAACTGA ATTTGGAGTC TAAAAGATCT 

GTATTATCAC TTTTTTATTT TGAAGGATGC CTTTTGATTA CAGAGGGAAA TCAAGGATTA 

AAAATCAATA TACATGTAAA TATTGAAATT CATTGGTAAC TTTAAAAAGC ACAACAGTTT 

TGTGTGCTTT TCTCCAAAGC ACTACAAATA TGATTAATTG ATGTATAAGA ATTTTCTTAT 

GGAATTTTTT TTTTTGTCTC TGTAG GTG GCA AAT CTT CCA AAA GAC TAC ATG 

Val Ala Asn Leu Pro Lys Asp Tyr Met 
20 25 



660 
72 0 
780 
840 
900 
949 

997 

1048 

1108 

1168 

1228 

1288 

1348 

1408 

1468 

1528 

1588 

1648 

1708 

1768 

1828 

1888 

1940 



-20X- 



ATA ACC CTC AAA TAT GTC CCC GGG ATG GAT GTT TTG GTATGTAAAC 
He Thr Leu Lys Tyr Val Pro Gly Met Asp Val Leu 

30 35 
TACATTTCTG AGTTTCATTT TAGTAGCTCA TAGAAGAAAT GGGATCATTC ATATTGAGAT 
AGTACACTAG CTGCTATTTA GGAGCTTGCT TATTGTCAGG ATTTGAAGAA TTTATCTTTG 
GAATTTGACT TGCAGGCTTT TTTTTCCCCC TCTTCCTGTT ACAAGAGTCC CTCCTCCTAT 
TACAATAGTC CCTCCTCCTC CTGTCACACT AGTCCCTTCT CTTCCTGTTA CAATAACCCC 
TGTCCTCCTA TTACAACATT TTAAGTAATG T AAT AT T AAT TTTAAAAATC TGGCCAGGCA 

„_™_ _™_™ £ T PCCAGCACAT TGGGAAGCTG AGACGGGTGG ATCATTTGAG 
LWl^i i^ft xvj^j.^^^. 

GTCAGGAAGT TTGAGACAGC CTGGCCAACA TGGTGAAACT TCCTCTCTAC TAAAAATAAA 
AAAGTAGCCA GGCATGGTGG CAGGCACTTG TAATCTGAGC TACTCGAGAG GCTGAGGCAG 
GAGAATCACT TGAGTAACTA AAACGATAGC TTTGAAGAGT ACTCCGAGTT TTATGGCACT 
TACTTATTAA AATAGCTGTT TTGTCTCTTT TTTCATATCT TGCAG CCA AGT CAT 

40 

TGT TGG ATA AGC GAG ATG GTA GTA CAA TTG TCA GAC AGC TTG ACT GAT 
Cys Trp He Ser Glu Met Val Val Gin Leu Ser Asp Ser Leu Thr Asp 
45 50 55 

CTT CTG GAC AAG TTT TCA AAT ATT TCT GAA GGC TTG AGT AAT TAT TCC 
Zl Leu Asp Lys Phe Ser Asn He Ser Glu Gly Leu Ser Asn Tyr Ser 
60 65 70 

ATC ATA GAC AAA CTT GTG AAT ATA GTG GAT GAC CTT GTG GAG TGC GTG 
lie lie Asp Lys Leu Val Asn He Val Asp Asp Leu Val Glu Cys Val 

75 80 85 

AAA GAA AAC TCA TCT AAG GTAACTTTGT GTTCATTGGG ATTATTTTTC 

Lys Glu Asn Ser Ser Lys 
95 

ATTACGCTTC TCTAAAAACC CATGCTTCTT GGTGCTGTTG GGGAAAATGA GGCACCTTTA 
TTTATGATAT TTTGATTGTA TAAACTTCAA ATTTAAAAAT CTTGTTCAGA TGAGCAAAGA 
AAACAAGTAT TTGCAGTTAT ACTGCAATAC TGAAGTGCAC ATTCTTGTGT TCACTGCCCC 
AGATTCAACT TGTGATCCCA CTGGGATCAC TACCCTGCAT TACCAATCTG AATTACATAC 
GTTAAAACAG CCATCTAAAA GTGCTAGTTG T AAGAGT CT A AATACTTGAA TCTTTGAGAG 
ACATATTTAT AGTCCATTAT CTTCACCTCA GTTAAGTCTG AAGACTATTT GAAAAATGTA 



1986 

2046 

2106 

2166 

2226 

2286 

2346 

2406 

2466 

2526 

2580 

2628 

2676 

2724 

2772 

2832 
2892 
2952 
3012 
3072 
3132 



-209- 

RT _ TATT „ T ^TTCTAG GAT CTA AAA AAA TCA TTC AAG AGC CCA GAA CCC 
* * ~ Asp Leu Lys Lys Ser Phe Lys Ser Pro Glu Pro 

100 

AGO CTC TTT ACT CCT GAA GAA TTC TTT AGA ATT TTT AAT AGA TCC ATT 
Arg Leu Phe Thr Pro Glu Glu Phe Phe Arg He Phe Asn Arg Ser He 
110 US 120 

GAT GCC TTC AAG GAC TTT GTA GTG GCA TCT GAA ACT AGT GAT TGT GTG 
Asp Ala Phe Lys Asp Phe Val Val Ala Ser Glu Thr Ser Asp Cys Val 
125 130 135 

GTT TCT TCA ACA TTA AGT CCT GAG AAA GGTAAGACAT GTAAGCATTT 
Val Ser Ser Thr Leu Ser Pro Glu Lys 
140 I 45 

C C AGT T C AAA TGTAAACAAC AAACTTAAAT CTTCCCTATG TAGTAAGAAT CTACCTCTGT 

GTTAAGCTGT AGCAAGATAC ATGCATGTAC GT CT AAT AAA AAAGCAGATA TCAATAGCAC 

AGAAGAAACT CTATAACTCA TACAAATCAC CATATAACAC TGACACATTA TTGCTTTCTA 

TTT AG ATT CCA GAG TCA GTG TCA CAA AAC CAT TTA TGT TAG CCC CTG 
He Pro Glu Ser Val Ser Gin Asn His Leu Cys Tyr Pro Leu 
150 155 160 

TTG CAG CCA GCT CCC TTA GGA ATG ACA GCA GTA GCA GTA ATAGTAAGTA 
Leu Gin Pro Ala Pro Leu Gly Met Thr Ala Val Ala Val 
165 170 175 

CATATATCTG ATTTAATGCA TGCATGGCTC CAATTAGCAC CTATAGGAGT ATTGCATGGG 
CTTTCAAGGA AACTTCTACA TTTATTATTA TTGATACTGT TCTGTTACTG TTATTCCTTT 
TATGGTCTTC TTGAGACTTA AGTTTGTAGA ATTAAATTTC CCTAGAGCTG GAGATAATGT 
TTAGAGAATT AGGCCAATAA ATTT 
(2) INFORMATION FOR SEQ ID NO: 44: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 195 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 44: 

Thr Trp He Leu Thr Cys lie Tyr Leu Gin Leu Leu Leu Phe Asn Pro 
-20 

Leu Val Lys Thr Glu Gly He Cys Arg Asn Arg Val Thr Asn Asn Val 
1 5 1° 



3184 

3 2 32 

3280 

3327 

3387 
3447 
3507 
3554 

3603 

3663 
3723 
3783 
3807 



Lys Asp Val Thr 
15 

Thr Leu Lys Tyr 

30 

lie Ser Glu Met 
45 

Asp Lys Phe Ser 



Asp Lys Leu Val 
80 

Asn Ser Ser Lys 
95 

Leu Phe Thr Pro 
110 

Ala Phe Lys Asp 
125 

Ser Ser Thr Leu 



His Leu Cys Tyr 
160 




Lys Leu Val Ala 

20 

Val Pro Gly Met 
35 

Val Val Gin Leu 
50 

Asn lie Ser Glu 
65 

Asn lie Val Asp 



Asp Leu Lys Lys 
100 

Glu Glu Phe Phe 
115 

Phe Val Val Ala 
130 

Ser Pro Glu Lys 
145 

Pro Leu Leu Gin 



-210- 

Asn Leu Pro Lys 



Asp Val Leu Pro 
40 

Ser Asp Ser Leu 
55 

Gly Leu Ser Asn 
70 

Asp Leu Val Glu 
85 

Ser Phe Lys Ser 



Arg lie Phe Asn 
120 

Ser Glu Thr Ser 
135 

lie Pro Glu Ser 
150 

Pro Ala Pro Leu 
165 




Asp Tyr Met lie 

25 

Ser His Cys Trp 



Thr Asp Leu Leu 
60 

Tyr Ser lie lie 
75 

Cys Val Lys Glu 
90 

Pro Glu Pro Arg 
105 

Arg Ser lie Asp 



Asp Cys Val Val 
140 

Val Ser Gin Asn 
155 

Gly Met Thr Ala 
170 



Val Ala Val 
175 



(2) INFORMATION FOR SEQ ID NO: 45: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 82 0 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE : DNA 



(ix) FEATURE: 

(A) NAME / KEY : CDS 

(B) LOCATION: 17.. 640 



(ix) FEATURE: 

(A) NAME/KEY: mat_peptide 

(B) LOCATION: 92. .640 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 4 5 : 



-211- 

AAGCTTGCCT TTCCTT ATG AAG AAG ACA CAA ACT TGG ATT CTC ACT TGC 4 9 

Met Lys Lys Thr Gin Thr Trp He Leu Thr Cys 
_25 -20 ~ 15 

ATT TAT CTT CAG CTG CTC CTA TTT AAT CCT CTC GTC AAA ACT GAA GGG 9 7 

He Tyr Leu Gin Leu Leu Leu Phe Asn Pro Leu Val Lys Thr Glu Gly 
-10 "5 1 

ATC TGC AGG AAT CGT GTG ACT AAT AAT GTA AAA GAC GTC ACT AAA TTG 14 5 

He Cys Arg Asn Arg Val Thr Asn Asn Val Lys Asp Val Thr Lys Leu 
5 10 15 

GTG GCA AAT CTT CCA AAA GAC TAC ATG ATA ACC CTC AAA TAT GTC CCC 193 
Val Ala Asn Leu Pro Lys Asp Tyr Met He Thr Leu Lys Tyr Val Pro 
20 25 30 

GGG ATG GAT GTT TTG CCA AGT CAT TGT TGG ATA AGC GAG ATG GTA GTA 241 
Gly Met Asp Val Leu Pro Ser His Cys Trp He Ser Glu Met Val Val 
35 40 45 50 

CAA TTG TCA GAC AGC TTG ACT GAT CTT CTG GAC AAG TTT TCA AAT ATT 289 
Gin Leu Ser Asp Ser Leu Thr Asp Leu Leu Asp Lys Phe Ser Asn He 
55 60 65 

TCT GAA GGC TTG AGT AAT TAT TCC ATC ATA GAC AAA CTT GTG AAT ATA 337 
Ser Glu Gly Leu Ser Asn Tyr Ser He He Asp Lys Leu Val Asn He 
70 75 80 

GTG GAT GAC CTT GTG GAG TGC GTG AAA GAA AAC TCA TCT AAG GAT CTA 385 
Val Asp Asp Leu Val Glu Cys Val Lys Glu Asn Ser Ser Lys Asp Leu 
85 90 95 

AAA AAA TCA TTC AAG AGC CCA GAA CCC AGG CTC TTT ACT CCT GAA GAA 433 
Lys Lys Ser Phe Lys Ser Pro Glu Pro Arg Leu Phe Thr Pro Glu Glu 
100 105 HO 

TTC TTT AGA ATT TTT AAT AGA TCC ATT GAT GCC TTC AAG GAC TTT GTA 4 81 

Phe Phe Arg He Phe Asn Arg Ser He Asp Ala Phe Lys Asp Phe Val 
115 120 125 130 

GTG GCA TCT GAA ACT AGT GAT TGT GTG GTT TCT TCA ACA TTA AGT CCT 52 9 

Val Ala Ser Glu Thr Ser Asp Cys Val Val Ser Ser Thr Leu Ser Pro 
135 140 145 

GAG AAA GAT TCC AGA GTC AGT GTC ACA AAA CCA TTT ATG TTA CCC CCT 577 
Glu Lys Asp Ser Arg Val Ser Val Thr Lys Pro Phe Met Leu Pro Pro 
150 155 160 

GTT GCA GCC AGC TCC CTT AGG AAT GAC AGC AGT AGC AGT AAT AGT AAG 625 
Val Ala Ala Ser Ser Leu Arg Asn Asp Ser Ser Ser Ser Asn Ser Lys 
165 170 175 



TAC ATA TAT CTG ATT TAATGCATGC ATGGCTCCAA TTAGCACCTA TAGGAGTATT 
Tyr He Tyr Leu He 
180 



S80 



p_ 

GCATGGGCTT TCAAGGAAAC TTCTACATTT ATTATTATTG ATACTGTTCT GTTACTGTTA 
TTCCTTTTAT GGTCTTCTTG AGACTTAAGT TTGTAGAATT AAATTTCCCT AGAGCTGGAG 
ATAATGTTTA GAGAATTAGG 
(2) INFORMATION FOR SEQ ID NO: 46: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 208 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY : linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:46: 

Met Lys Lys Thr Gin Thr Trp lie Leu Thr Cys He Tyr Leu Gin Leu 
_ 25 -20 "15 -10 

Leu Leu Phe Asn Pro Leu Val Lys Thr Glu Gly He Cys Arg Asn Arg 
-5 1 5 

Val Thr Asn Asn Val Lys Asp Val Thr Lys Leu Val Ala Asn Leu Pro 
10 15 20 

Lys Asp Tyr Met He Thr Leu Lys Tyr Val Pro Gly Met Asp Val Leu 
25 30 35 

Pro Ser His Cys Trp He Ser Glu Met Val Val Gin Leu Ser Asp Ser 
40 45 50 55 

Leu Thr Asp Leu Leu Asp Lys Phe Ser Asn He Ser Glu Gly Leu Ser 



60 



65 70 



Asn Tyr Ser He He Asp Lys Leu Val Asn He Val Asp Asp Leu Val 
75 80 85 

Glu Cys Val Lys Glu Asn Ser Ser Lys Asp Leu Lys Lys Ser Phe Lys 
90 95 100 

Ser Pro Glu Pro Arg Leu Phe Thr Pro Glu Glu Phe Phe Arg He Phe 
105 HO 115 

Asn Arg Ser He Asp Ala Phe Lys Asp Phe Val Val Ala Ser Glu Thr 
120 125 130 135 

Ser Asp Cys Val Val Ser Ser Thr Leu Ser Pro Glu Lys Asp Ser Arg 
140 145 150 

Val Ser Val Thr Lys Pro Phe Met Leu Pro Pro Val Ala Ala Ser Ser 
155 160 165 

Leu Arg Asn Asp Ser Ser Ser Ser Asn Ser Lys Tyr He Tyr Leu He 
170 175 180 



740 



300 



820 
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(2) INFORMATION FOR SEQ ID NO : 4 7 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 5864 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: j oin ( 56 5 . . 57 9 , 1684.. 1797, 2693.. 2755, 3351.. 3521, 

3932. .4088, 4314. .4397, 4778. .4887, 5208. .5275, 
5677 . . 5713) 

(ix) FEATURE: 

(A) NAME/KEY: mat_peptide 

(B) LOCATION: j oin ( 1 74 4 . . 17 9 7 , 2693.-2755, 3351.. 3521, 3932 

..4088, 4314. .4397, 4778. .4887, 5208. .5275, 5677 
. .5713) 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:47: 

GAGCTCCGAG CCCTCTCTGG CGCGCGAGGT ATTTCGTCTG TNCCCGGGGG TGCCAGGTGA 

GCCCCAGCGG ATCCGGGAGG GTAAGCTGGG ACTCCTCGCG AG C AG TAG C T GCAGGGTACC 

AAGCTTCGCC CTCTGCGTCC CCGCGCCTTC GCGGTCTCCC GCCAGTGCAG GTCCGGGGCC 

CCCAGGCGAG CGGACAAGGT TGGCCTAATC TGCCAAACTT CTGGGGCATT TACCGTGCTC 

TGGCCGCCCT CCCGATTCTT CCCTCCGCGC CCTTGCCTGC TTCTCGCCTA CCCCGGGCTC 

CGGAAGGGAA GGAGGCGTGT CCGGAGCAGG CGGGCGGGAA CTGTATAAAA GCGCCGGCGG 

CTCAGCAGCC GGCTTCGCTC GCCGCCTCGC GCCGAGACTA GAAGCGCTGC GGGAAGCAGG 

GACAGTGGAG AGGGCGCTGC GCTCGGGCTA CCCAATGCGT GGACTATCTG CCGCCGCTGT 

TCGTGCAATA TGCTGGAGCT CCAGAACAGC TAAACGGAGT CGCCACACCA CTGTTTGTGC 

TGGATCGCAC CGCTGCCTTT CCTT ATG AAG AAG ACA CAA GTGAGTAGGG 

Met Lys Lys Thr Gin 

-25 

CGCGCCCGGG AGCTCCCAGG CTCTCCAGGA AAAATCGCGC CCGGTGCCCC GGGGAAGCCG 
GCGCTCCCTG GGACTTGCAG CTGGGGCGTG CAGGGCTGTG CCTGCCGGGT GAGACAAGAG 
GATGCGGGGG AGGCCGGCGT GGTGTGTGAT CCCGAGCCGA GCCGNNTGAG CCAGGGAGAA 
AAGGAGTGGG AGTACTGAGA GGGAGCCAGT GTCAAGTTTG GAGCCTCAGC AGTTAAGTTT 
TGAGCTGTCA GTCGGAAACC GTAATTCCCG TCTGGTGGAA AGATTGGCTT TTNGNCCACG 



60 
120 
180 
240 
300 
360 
420 
480 
540 

589 

649 
709 
769 
829 
889 
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GAATGTAAGT TATCACAGAT AC TAC AAAG A TAAATCAGTT GCACAAGTTC TTGAAACTCT 

ACAGTGTAAT AAGGAAAAAT AAGTCATGCA TAAAAGCAAC TATAATACAT AATAGAAAAT 

GTTATTTTCA AGCCGATGTG TAGGTTATGT GTGTTCGAGA GAGAGAGAGA GAAGACAGAT 

TACTTTCTGC TAGGGTTCAA GAATGCCTTC CTGTTGGCTA AGGAAATATT TTCCTTAAGT 

GGCTAAAAAG CTGTGTTTCA AAATATTCTT TTGATGTCTC ACAAATTCAG TGGAATTCTC 

TTAGGTCTAA AAATATACAT CTCTCTCACT TTAACTTGGT GTGCTATTGT AGATTATTGG 

ATTAAAGCAC TGCTCAGGGA TTATGCTGCT TCTTGCCAAG CAGTCTACAT TTAAAGTAGA 

AATAAGATGT TTCTTTTGGT GCCATAAGGT AT AC AT TT T A TGCATTCTCT AGTTTTTAGA 

AGATACCCTA AGGGCTAAGT CTTTAACATG CTGCTACAAG TTTATTCCTA ATTGCCATTG 

GGAAATTGGC TGAAGAAAGT TTTTAACAAA AGTTAACAAT ATTGTCATTG AGAGAATAAT 

TCAAAATGGA TTTTAACTAA AAGCTTTTAA AAACTTTGGT GAGCATAGCT TGAATGCGTA 

ATATTTAATT GCATTTAAGC CAATAACATA TATTAGACTG GTCTTTTTGT GCATCAAGGC 

ATTAGATGTT AAAAGTTTGA ATGATTACAG ATCTTAACTG ATGATCACCA AGCAATTTTT 

CTGTTTTCAT TTAG ACT TGG ATT CTC ACT TGC ATT TAT CTT CAG CTG CTC 
Thr Trp He Leu Thr Cys He Tyr Leu Gin Leu Leu 
-20 

TTT AAT CCT CTC GTC AAA ACT GAA GGG ATC TGC AGG AAT CGT GTG 
Leu Phe Asn Pro Leu Val Lys Thr Glu Gly lie Cys Arg Asn Arg Val 
-5 1 5 

ACT AAT AAT GTA AAA GAC GTC ACT AAA TTG GTAAGTAAGG AATGCTTTAC 
Thr Asn Asn Val Lys Asp Val Thr Lys Leu 
10 15 

CGTGCTGTGT AAAAAAGAGC TGTGGCTCTT TTTCCTGTGC TTGTTGATAA AAGATTTAGA 
TTTTTCTTGC C C C AAAG T AA TGTTTTCCTA AAGTGGGGAA AGTAATCACT GGGTTACAAT 
AAAGGGTTTA TAGAAAGCAG GTAGTGAGAT ATTTAGGGTC ATGGATAATT TGTTGGTAAA 
ACTGGCTAGT TGCACACCAC TGCTGTGACT GCTTCTTTGC TGGTCTTCTC CCCATCCTTC 
ATAGGCAGTG AAGGACCTTG GAGAGTTCGC TGTGTGCTGA TGGGCTTGCC CCAGCTTGTT 
CCCCATAATC TCTCCAGTGG GTTTCCCAGC ATGTTCTATT CCCCTTCACA TGTCTTCCTA 
CTCTTCTTTA AAAAGCCTAA C G AAAGG AAA TCTGAAATGG CTATTCTCCC AATTCAATCA 
GCAGGAAGAC CCTGTCACAT GTCAGTGGGT GTTTGCTCCT TCAGGGAACA TAGAGAGGTG 
ATTCATTGCC CACATGTTGA AGGGACTCAT CTCCCTGGTT TGTCACATTG AACTCTTCCC 



949 
1009 
1069 
1129 
1189 
1249 
1309 
1369 
1429 
1489 
1549 
1609 
1669 
1719 

1767 

1817 

1877 

1937 

1997 

2057 

2117 

2177 

2237 

2297 

2357 
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TCAGCGAAAG 


CATTTGCATT 


GCTTCCCGAA 


TTCCAAGATC 


ACAGGTGGAA 


GCTGAAATTC 


2417 


AGATCATGTT 


TCCAAAACTC 


AGTAGGTTAT 


ACCTAGCCAG 


GCATAACTGA 


ATTTGGAGTC 


2 4 77 


TAAAAGATCT 


GT ATT AT C AC 


TTTTTTATTT 


TGAAGGATGC 


CTTTTGATTA 




2 5 3 7 


TCAAGGATTA 


AAAATCAATA 


TACATGTAAA 


TATTGAAATT 


CATTGGTAAC 


TTTAAAAAGC 


2597 


ACAACAGTTT 


TGTGTGCTTT 


TCTCCAAAGC 


AC T AC AAAT A 


TGATTAATTG 


ATGTATAAGA 


2657 


ATTTTCTTAT 


GGAATTTTTT 


TTTTTGTCTC 


TGTAG GTG 
Val 


GCA AAT CTT 
Ala Asn Leu 


CCA AAA 
Pro Lys 


2710 



20 

a ?r t A C ATG ATA ACC CTC AAA TAT GTC CCC GGG ATG GAT GTT TTG 
Asp Tyr Met He Thr Leu Lys Tyr Val Pro Gly Met Asp Val Leu 
25 30 35 



2755 



GTATGTAAAC 


TACATTTCTG 


AGTTTCATTT 


TAGTAGCTCA 


TAGAAGAAAT 


GGGATCATTC 


2815 


ATATTGAGAT 


AGTACACTAG 


CTGCTATTTA 


GGAGCTTGCT 


TATTGTCAGG 


ATTTGAAGAA 


2875 


TTTATCTTTG 


G AAT T T G AC T 


TGCAGGCTTT 


TTTTTCCCCC 


TCTTCCTGTT 


ACAAGAGTCC 


2935 


CTCCTCCTAT 


TACAATAGTC 


CCTCCTCCTC 


CTGTCACACT 


AGTCCCTTCT 


CTTCCTGTTA 


2995 


CAATAACCCC 


TGTCCTCCTA 


TTACAACATT 


TTAAGTAATG 


T AAT ATT AAT 


TTTAAAAATC 


3055 


TGGCCAGGCA 


CGGTGGTTCA 


TGCTTGTAAT 


CCCAGCACAT 


TGGGAAGCTG 


AGACGGGTGG 


3115 


ATCATTTGAG 


GTCAGGAAGT 


TTGAGACAGC 


CTGGCCAACA 


TGGTGAAACT 


TCCTCTCTAC 


3175 


TAAAAATAAA 


AAAGTAGCCA 


GGCATGGTGG 


CAGGCACTTG 


T AAT C T GAG C 


TACTCGAGAG 


3235 


GCTGAGGCAG 


GAGAATCACT 


TGAGTAACTA 


AAACGATAGC 


TTTGAAGAGT 


ACTCCGAGTT 


3295 


TTATGGCACT 


TACTTATTAA 


AATAGCTGTT 


TTGTCTCTTT 


TTTCATATCT 


TGCAG CCA 
Pro 


3353 



40 



AGT CAT TGT TGG ATA AGC GAG ATG GTA GTA CAA TTG TCA GAC AGC TTG 
Ser His Cys Trp lie Ser Glu Met Val Val Gin Leu Ser Asp Ser Leu 
45 50 55 

ACT GAT CTT CTG GAC AAG TTT TCA AAT ATT TCT GAA GGC TTG AGT AAT 
Thr Asp Leu Leu Asp Lys Phe Ser Asn He Ser Glu Gly Leu Ser Asn 
60 65 70 

TAT TCC ATC ATA GAC AAA CTT GTG AAT ATA GTG GAT GAC CTT GTG GAG 
Tyr Ser He He Asp Lys Leu Val Asn He Val Asp Asp Leu Val Glu 
75 80 85 

TGC GTG AAA GAA AAC TCA TCT AAG GTAACTTTGT GTTCATTGGG ATTATTTTTC 
Cys Val Lys Glu Asn Ser Ser Lys 
90 95 



3401 



3449 



3497 



3551 
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ATTACGCTTC TCTAAAAACC CATGCTTCTT GGTGCTGTTG GGGAAAATGA GGCACCTTTA 3611 

TTTATGATAT TTTGATTGTA TAAACTTCAA ATTTAAAAAT CTTGTTCAGA TGAGCAAAGA 36 71 

AAACAAGTAT TTGCAGTTAT ACTGCAATAC TGAAGTGCAC TATTCTTGTG TTCACTGCCC 3 731 

CAGATTCAAC TTGTGATCCC ACTGGGATCA CTACCCTGCA TTACCAATCT GAATTACATA 3 7 91 

CGTTAAAACA GCCATCTAAA AGTGCTAGTT GTAAGAGTCT AAATACTTGA ATCTTTGAGA 3 8 51 

GACATATTTA TAGTCCATTA TCTTCACCTC AGTTAAGTCT GAAGACTATT TGAAAAATGT 3 911 

AATCCTATTT TTTCTTCTAG GAT CTA AAA AAA TCA TTC AAG AGC CCA GAA 3 961 

Asp Leu Lys Lys Ser Phe Lys Ser Pro Glu 
100 105 

CCC AGG CTC TTT ACT CCT GAA GAA TTC TTT AGA ATT TTT AAT AGA TCC 4 00 9 

Pro Arg Leu Phe Thr Pro Glu Glu Phe Phe Arg He Phe Asn Arg Ser 
HO 115 12° 

ATT GAT GCC TTC AAG GAC TTT GTA GTG GCA TCT GAA ACT AGT GAT TGT 4 0 57 

He Asp Ala Phe Lys Asp Phe Val Val Ala Ser Glu Thr Ser Asp Cys 
125 130 135 

GTG GTT TCT TCA ACA TTA AGT CCT GAG AAA G GTAAGACATG TAAGCATTTC 4108 
Val Val Ser Ser Thr Leu Ser Pro Glu Lys 
140 145 



4168 
4228 



CAGTTCAAAT GTAAACAACA AACTTAAATC TTCCCTATGT AG T AAG AAT C TACCTCTGTG 

TTAAGCTGTA GCAAGATACA TGCATGTACG TCTAAAAAAA AG C AG AT AT C AAT AGC AC AG 

AAGAAACTAA TGATTGTAGA TTTGTGGGCT CTATAACTCA TACAAATCAC CATATAACAC 42 8 8 

TGACACATTA TTGCTTTCTA TTT AG AT TCC AGA GTC AGT GTC ACA AAA CCA 433 9 

Asp Ser Arg Val Ser Val Thr Lys Pro 
150 155 

TTT ATG TTA CCC CCT GTT GCA GCC AGC TCC CTT AGG AAT GAC AGC AGT 4 3 87 

Phe Met Leu Pro Pro Val Ala Ala Ser Ser Leu Arg Asn Asp Ser Ser 
160 165 170 

AGC AGT AAT A GTAAGTACAT ATATCTGATT TAATGCATGC ATGGCTCCAA 44 3 7 
Ser Ser Asn 
175 

TTAGCACCTA TAGGAGTATT GCATGGGCTT TCAAGGAAAC TTCTACATTT ATTATTATTG 44 9 7 

ATACTGTTCT GTTACTGTTA TTCCTTTTAT GGTCTTCTTG AGACTTAAGT TTGTAGAATT 4 55 7 

AAATTTCCCT AGAGCTGGAG ATAATGTTTA GAGAATTAGG CCAATAAATT TTCTGCTGAG 4617 

GTTATTTTAA ATAAGACATA AAATTAATTT T AG AAAT AT G ATTTATGCCT TTTGTTGAAT 46 77 

CATTAACATA TATACAGAAA C AG T T AAAAC AACCACAGCA TAAGAGAAAA ACTTCTAGAA 4 73 7 
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TGGATATGCT GTATTCATCA GTGTGTTCTT TAAATTATAG GG AAG GCC AAA AAT 

Arg Lys Ala Lys Asn 
180 

CCC CCT GGA GAC TCC AGC CTA CAC TGG CCA GCC ATG GCA TTG CCA GCA 
Pro Pro Gly Asp Ser Ser Leu His Trp Pro Ala Met Ala Leu Pro Ala 
185 190 195 

TTG TTT TCT CTT ATA ATT GGC TTT GCT TTT GGA GCC TTA TAC TGG AAG 
Leu Phe Ser Leu He He Gly Phe Ala Phe Gly Ala Leu Tyr Trp Lys 
200 205 210 



240 

GAG TTT CAA GAA GTG TAATTGTGGC TTGTATCAAC ACTGTTACTT TCGTACATTG 
Glu Phe Gin Glu Val 
245 

GTAAGTTTTT TTCTTCTTTC CTTTTTTTTT CTTTTTTTTA TTATACTTTA AGTTCTAGGG 
TACATGTGCA CAATGTGCAG GTTTGTTACG TATGTTTACA TGTGCCATGT T 



4791 



4839 



4887 



4947 
5007 



GTAAGTGGTA CCATTCCTTT TTTTAAAAAT ATGCTATGTT TACATAAATT ATCATCTTTT 

TTTCCTCAAG AAATGATCCT TTAAGAAAAC AGTGAATCTA CCTTAGCTTA TACTAAACAA 

AATTTAAATT TTATAAAGTT TCCTGTTTCT CATTATGTCT GGAGACAATC CCTCTAGCTG 506 7 

ATAATTCACG CTTAAGAATT AGGAACTAAA ACTGTTATTG GAGTTATTGC CATAAAAGAT 512 7 

AAAAGTGGAG TCCACTTACC TCTTAAATAT TAGACCATTC ATTGATTATT TTACAGTATA 518 7 

TGTCTTTCTT CTTTTTCCAG AAG AGA CAG CCA AGT CTT ACA AGG GCA GTT 

Lys Arg Gin Pro Ser Leu Thr Arg Ala Val 
215 220 

GAA AAT ATA CAA ATT AAT GAA GAG GAT AAT GAG ATA AG GTATTTTGTT 



5237 



5285 



225 




230 




235 






TTGCTAAATG 


TGTGCCCAAT 


CAAGCATGAC 


ATTGCCATTT 


CACACACTGT 


GTACCTGCCC 


5345 


ATAATGTCTT 


TAAGAAGTCC 


TTCACTCATG 


ACAGTAGCTC 


CTAACCAGTG 


AGTCCCAACT 


5405 


CTATCCATGT 


TTCTGATGTC 


TCACTCTCTC 


TTCGTATGTG 


TAT ATG CAT A 


TACAGAGAAA 


5465 


GAAATGTTTT 


AACTACTTGG 


AAAGACTACC 


TTAAGACAAA 


TGAAGTCTTC 


CCTCTTCCCT 


5525 


ATAGTAATAA 


GAAGGTAGGC 


TCCCCCATTC 


AATTTTGCAA 


TCTTCTGCTA 


CTATATTTAC 


5585 


AGAAAAGCTG 


CCTTTTACAA 


TGCCGAGATC 


ATGGTGTACC 


TCAGAATCTC 


TG AC C AAG AG 


5645 


CAAATAAGCA 


TTTTTTCTTA 


TTGTTTTTCA 


G T ATG TTG CAA GAG AAA GAG AGA 
Met Leu Gin Glu Lys Glu Arg 


5698 



5753 

5813 
5864 



(2) INFORMATION FOR SEQ ID NO : 4 8 : 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 273 amino acids 

(B) TYPE : amino acid 
( D ) TOPOLOGY : linear 



(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 48: 

Met Lys Lys Thr Gin Thr Trp He Leu Thr Cys He Tyr Leu Gin Leu 
_25 -20 "15 -10 

Leu Leu Phe Asn Pro Leu Val Lys Thr Glu Gly He Cys Arg Asn Arg 
-5 1 ^ 

Val Thr Asn Asn Val Lys Asp Val Thr Lys Leu Val Ala Asn Leu Pro 
10 15 20 

Lys Asp Tyr Met He Thr Leu Lys Tyr Val Pro Gly Met Asp Val Leu 
25 30 35 

Pro Ser His Cys Trp He Ser Glu Met Val Val Gin Leu Ser Asp Ser 
40 45 50 55 

Leu Thr Asp Leu Leu Asp Lys Phe Ser Asn He Ser Glu Gly Leu Ser 
60 65 70 

Asn Tyr Ser He He Asp Lys Leu Val Asn He Val Asp Asp Leu Val 
75 80 85 

Glu Cys Val Lys Glu Asn Ser Ser Lys Asp Leu Lys Lys Ser Phe Lys 
90 95 100 

Ser Pro Glu Pro Arg Leu Phe Thr Pro Glu Glu Phe Phe Arg He Phe 
105 HO H5 

Asn Arg Ser He Asp Ala Phe Lys Asp Phe Val Val Ala Ser Glu Thr 
120 125 130 135 

Ser Asp Cys Val Val Ser Ser Thr Leu Ser Pro Glu Lys Asp Ser Arg 
140 145 150 

Val Ser Val Thr Lys Pro Phe Met Leu Pro Pro Val Ala Ala Ser Ser 
155 160 165 

Leu Arg Asn Asp Ser Ser Ser Ser Asn Arg Lys Ala Lys Asn Pro Pro 
170 175 180 

Gly Asp Ser Ser Leu His Trp Pro Ala Met Ala Leu Pro Ala Leu Phe 
185 190 195 

Ser Leu He He Gly Phe Ala Phe Gly Ala Leu Tyr Trp Lys Lys Arg 
200 205 210 215 



-2 1 9- 



Gln Pro Ser Leu 



Thr Arg Ala Val Glu Asn lie Gin lie Asn Glu Glu 



220 



230 



Asp Asn Glu He Ser Met Leu Gin Glu Lys Glu Arg Glu Phe Gin Glu 
235 240 245 



Val 

(2) INFORMATION FOR SEQ ID NO: 49: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 273 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 4 9 : 

Met Lys Lys Thr Gin Thr Trp He Leu Thr Cys He Tyr Leu Gin Leu 
15 10 15 

Leu Leu Phe Asn Pro Leu Val Lys Thr Glu Gly He Cys Arg Asn Arg 
20 25 30 

Val Thr Asn Asn Val Lys Asp Val Thr Lys Leu Val Ala Asn Leu Pro 
35 40 45 

Lys Asp Tyr Met He Thr Leu Lys Tyr Val Pro Gly Met Asp Val Leu 
50 55 60 

Pro Ser His Cys Trp He Ser Glu Met Val Val Gin Leu Ser Asp Ser 
65 70 75 80 

Leu Thr Asp Leu Leu Asp Lys Phe Ser Asn He Ser Glu Gly Leu Ser 
85 90 95 

Asn Tyr Ser He He Asp Lys Leu Val Asn He Val Asp Asp Leu Val 
100 105 HO 

Glu Cys Val Lys Glu Asn Ser Ser Lys Asp Leu Lys Lys Ser Phe Lys 
115 120 125 

Ser Pro Glu Pro Arg Leu Phe Thr Pro Glu Glu Phe Phe Arg He Phe 
130 135 140 

Asn Arg Ser He Asp Ala Phe Lys Asp Phe Val Val Ala Ser Glu Thr 
145 150 155 160 

Ser Asp Cys Val Val Ser Ser Thr Leu Ser Pro Glu Lys Asp Ser Arg 
165 170 175 

Val Ser Val Thr Lys Pro Phe Met Leu Pro Pro Val Ala Ala Ser Ser 
180 185 190 
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Leu Arg Asn Asp Ser Ser Ser Ser Asn Arg Lys Ala Lys Asn Pro Pro 
19 5 200 205 

Gly Asp Ser Ser Leu His Trp Ala Ala Met Ala Leu Pro Ala Leu Phe 
210 215 220 

Ser Leu He He Gly Phe Ala Phe Gly Ala Leu Tyr Trp Lys Lys Arg 

o t n 235 24 0 

225 230 

Gin Pro Ser Leu Thr Arg Ala Val Glu Asn He Gin He Asn Glu Glu 



245 



250 



255 



Asp 



Asn Glu He Ser Met Leu Gin Glu Lys Glu Arg Glu Phe Gin Glu 



260 



265 



270 



Val 



(2) INFORMATION FOR SEQ ID NO: 50: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 273 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 50: 

Met Lys Lys Thr Gin Thr Trp He Leu Thr Cys He Tyr Leu Gin Leu 
1 5 10 15 

Leu Leu Phe Asn Pro Leu Val Lys Thr Glu Gly He Cys Arg Asn Arg 
20 25 30 

Val Thr Asn Asn Val Lys Asp Val Thr Lys Leu Val Ala Asn Leu Pro 
35 40 45 

Lys Asp Tyr Met He Thr Leu Lys Tyr Val Pro Gly Met Asp Val Leu 



50 



55 



60 



Pro Ser His Cys Trp He Ser Glu Met Val Val Gin Leu Ser Asp Ser 



65 



70 



75 



80 



Leu Thr Asp Leu Leu Asp Lys Phe Ser Asn He Ser Glu Gly Leu Ser 



85 



90 



95 



Asn Tyr Ser He He Asp Lys Leu Val Asn He Val Asp Asp Leu Val 



100 



105 



110 



Glu Cys Val Lys Glu Asn Ser Ser Lys Asp Leu Lys Lys Ser Phe Lys 

115 120 125 

Ser Pro Glu Pro Arg Leu Phe Thr Pro Glu Glu Phe Phe Arg He Phe 

130 135 140 



Asn Arg Ser lie Asp Ala Phe Lys Asp Phe Ala Val Ala Ser Glu Thr 
145 150 155 160 

Ser Asp Cys Val Val Ser Ser Thr Leu Ser Pro Glu Lys Asp Ser Arg 
165 170 175 

Val Ser Val Thr Lys Pro Phe Met Leu Pro Pro Val Ala Ala Ser Ser 
180 185 190 

Leu Arg Asn Asp Ser Ser Ser Ser Asn Arg Lys Ala Lys Asn Pro Thr 
195 200 205 

Gly Asp Ser Ser Leu His Trp Ala Ala Met Ala Leu Pro Ala Phe Phe 
210 215 220 

Ser Leu lie lie Gly Phe Ala Phe Giy Ala Leu Tyr Trp Lys Lys Arg 
225 230 235 240 

Gin Pro Ser Leu Thr Arg Ala Val Glu Asn lie Gin lie Asn Glu Glu 
245 250 255 

Asp Asn Glu lie Ser Met Leu Gin Glu Lys Glu Arg Glu Phe Gin Glu 
260 265 270 



Val 



(2) INFORMATION FOR SEQ ID NO: 51: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 74 amino acids 

(B) TYPE : amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY : linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:51: 

Met Lys Lys Thr Gin Thr Trp He He Thr Cys He Tyr Leu Gin Leu 
15 10 15 



Leu Leu Phe Asn Pro Leu Val Lys Thr Lys Gly He Cys Gly Lys Arg 
20 25 30 

Val Thr Asp Asp Val Lys Asp Val Thr Lys Leu Val Ala Asn Leu Pro 
35 40 45 



Lys Asp Tyr Lys He Ala Leu Lys Tyr Val Pro Gly Met Asp Val Leu 
50 55 60 

Pro Ser His Cys Trp He Ser Val Met Val Glu Gin Leu Ser Val Ser 

65 70 75 80 



Leu Thr Asp Leu Leu Asp Lys Phe Ser Asn He Ser Glu Gly Leu Ser 
85 90 95 



Asn Tyr Ser lie lie Asp Lys Leu Val Lys lie Val Asp Asp Leu Val 
100 105 110 

Glu Cys Thr Glu Gly Tyr Ser Phe Glu Asn Val Lys Lys Ala Pro Lys 
115 120 125 

Ser Pro Glu Leu Arg Leu Phe Thr Pro Glu Glu Phe Phe Arg lie Phe 
130 135 140 

Asn Arg Ser lie Asp Ala Phe Lys Asp Leu Glu Thr Val Ala Ser Lys 
145 150 155 160 

Ser Ser Glu Cys Val Val Ser Ser Thr Leu Ser Pro Asp Lys Asp Ser 
165 170 175 

Arg Val Ser Val Thr Lys Pro Phe Met Leu Pro Pro Val Ala Ala Ser 
180 185 190 

Ser Leu Arg Asn Asp Ser Ser Ser Ser Asn Arg Lys Ala Ser Asn Ser 
195 200 205 

lie Gly Asp Ser Asn Leu Gin Trp Ala Ala Met Ala Leu Pro Ala Phe 
210 215 220 

Phe Ser Leu Val lie Gly Phe Ala Phe Gly Ala Leu Tyr Trp Lys Lys 
225 230 235 240 

Lys Gin Pro Asn Leu Thr Arg Thr Val Glu Asn lie Gin lie Asn Glu 
245 250 255 

Glu Asp Asn Glu lie Ser Met Leu Gin Glu Lys Glu Arg Glu Phe Gin 
260 265 270 

Glu Val 

(2) INFORMATION FOR SEQ ID NO: 52: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 271 amino acids 

(B) TYPE : amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 52: 

Met Lys Thr Gin Thr Trp lie Val Thr Cys lie Tyr Leu Gin Leu Leu 
15 10 15 

Phe Asn Pro Leu Val Lys Thr Lys Gly Leu Cys Arg Asn Arg Val Thr 
20 25 30 



Asp Asp Val Lys Asp Val Thr Lys Leu Val Ala Asn Leu Pro Lys Asp 
35 40 45 



Tyr Lys He Ala Leu Lys Tyr Val Pro Gly Met Asp Val Leu Pro Ser 

50 55 60 

His Cys Trp He Ser Val Met Val Glu Gin Leu Ser Val Ser Leu Thr 
65 70 75 80 

Asp Leu Leu Asp Lys Phe Ser Asn He Ser Glu Gly Leu Ser Asn Tyr 
85 90 95 

Ser He He Asp Lys Leu Val Lys He Val Asp Asp Leu Val Glu Cys 
100 105 HO 

Val Glu Gly His Ser Ser Glu Asn Val Lys Lys Ser Ser Lys Ser Pro 
115 120 125 

Glu Pro Arg Leu Phe Thr Pro Glu Glu Phe Phe Arg lie Phe Asn Arg 
130 135 140 

Ser He Asp Ala Phe Lys Asp Leu Glu Met Val Ala Ser Lys Thr Ser 
145 150 155 160 

Glu Cys Val Val Ser Ser Thr Leu Ser Pro Glu Lys Asp Ser Arg Val 
165 170 175 

Ser Val Thr Lys Pro Phe Met Leu Pro Pro Val Ala Ala Ser Ser Leu 
180 185 190 

Arg Asn Asp Ser Ser Ser Ser Asn Arg Lys Thr Asn Pro He Glu Asp 
195 200 205 

Ser Ser He Gin Trp Ala Val Met Ala Leu Pro Ala Cys Phe Ser Leu 
210 215 220 

Val He Gly Phe Ala Phe Gly Ala Phe Tyr Trp Lys Lys Lys Gin Pro 
225 230 235 240 

Asn Leu Thr Arg Thr Val Glu Asn He Gin He Asn Glu Glu Asp Asn 
245 250 255 

Glu He Ser Met Leu Gin Glu Lys Glu Arg Glu Phe Gin Glu Val 
260 265 270 

(2) INFORMATION FOR SEQ ID NO: 53: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 273 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY : linear 

(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 53: 
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Met Lys Lys Thr Gin Thr Trp He He Thr Cys He Tyr Leu Gin Leu 

1 5 10 15 

Leu Leu Phe Asn Pro Leu Val His Thr Gin Gly He Cys Ser Asn Arg 
20 25 30 

Val Thr Asp Asp Val Lys Asp Val Thr Lys Leu Val Ala Asn Leu Pro 
35 40 45 

Lys Asp Tyr Met He Thr Leu Lys Tyr Val Pro Gly Met Asp Val Leu 
50 55 60 

Pro Ser His Cys Trp He Ser Glu Met Val Glu Gin Leu Ser Val Ser 
65 70 75 80 

Leu Thr Asp Leu Leu Asp Lys Phe Ser Asn lie Ser Glu Gly Leu Ser 
85 90 95 

Asn Tyr Cys He He Asp Lys Leu Val Lys He Val Asp Asp Leu Val 
100 105 HO 

Glu Cys Met Glu His Ser Ser Glu Asn Val Lys Lys Ser Ser Lys Ser 
115 120 125 

Pro Glu Pro Arg Gin Phe Thr Pro Glu Lys Phe Phe Gly He Phe Asn 
130 135 140 

Lys Ser He Asp Ala Phe Lys Asp Leu Glu He Val Ala Ser Lys Met 
145 150 155 160 

Ser Glu Cys Val He Ser Ser Thr Ser Ser Pro Glu Lys Asp Ser Arg 
165 170 175 

Val Ser Val Thr Lys Pro Phe Met Leu Pro Pro Val Ala Ala Ser Ser 
180 185 190 

Leu Arg Asn Asp Ser Ser Ser Ser Asn Arg Lys Ala Ser Asn Ser He 
195 200 205 

Glu Asp Ser Ser Leu Gin Trp Ala Ala Val Ala Leu Pro Ala Phe Phe 
210 215 220 

Ser Leu Val He Gly Phe Ala Phe Gly Ala Phe Tyr Trp Lys Lys Lys 
225 230 235 240 

Gin Pro Asn Leu Thr Arg Thr Val Glu Asn Arg Gin He Asn Glu Glu 
245 250 255 

Asp Asn Glu He Ser Met Leu Gin Glu Lys Glu Arg Glu Phe Gin Glu 
260 265 270 



Val 



(2) INFORMATION FOR SEQ ID NO : 54 : 
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(l) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 273 ammo acids 

(B) TYPE : amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY : linear 



(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 54: 

Met Lys Lys Thr Gin Thr Trp lie He Thr Cys He Tyr Leu Gin Leu 
15 10 15 

Leu Leu Phe Asn Pro Leu Val Lys Thr Gin Glu He Cys Arg Asn Pro 
20 25 30 

Val Thr Asp Asn Val Lys Asp He Thr Lys Leu Val Ala Asn Leu Pro 
35 40 45 

Asn Asp Tyr Met lie Thr Leu Asn Tyr Val Ala Gly Met Asp Val Leu 
50 55 60 

Pro Ser His Cys Trp Leu Arg Asp Met Val Thr His Leu Ser Val Ser 
65 70 75 80 

Leu Thr Thr Leu Leu Asp Lys Phe Ser Asn He Ser Glu Gly Leu Ser 
85 90 95 

Asn Tyr Ser He He Asp Lys Leu Gly Lys He Val Asp Asp Leu Val 
100 105 HO 

Ala Cys Met Glu Glu Asn Ala Pro Leu Asn Val Lys Glu Ser Leu Lys 
115 120 125 

Lys Pro Glu Thr Arg Asn Phe Thr Pro Glu Glu Phe Phe Ser He Phe 
130 135 140 

Asn Arg Ser He Asp Ala Phe Lys Asp Phe Met Val Ala Ser Asp Thr 
145 150 155 160 

Ser Asp Cys Val Leu Ser Ser Thr Leu Gly Pro Glu Lys Asp Ser Arg 
165 170 175 

Val Ser Val Thr Lys Pro Phe Met Leu Pro Pro Val Ala Ala Ser Ser 
180 185 190 

Leu Arg Asn Asp Ser Ser Ser Ser Asn Arg Lys Ala Ala Lys Ser Pro 
195 200 205 

Glu Asp Pro Gly Leu Gin Trp Thr Ala Met Ala Leu Pro Ala Leu He 
210 215 220 



Ser Leu Val He Gly Phe Ala Phe Gly Ala Leu Tyr Trp Lys Lys Lys 
225 230 235 240 
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Gln Ser Ser Leu Thr Arg Ala Val Glu Asn lie Gin lie Asn Glu Glu 
245 250 255 

Asp Asn Glu lie Ser Met Leu Gin Gin Lys Glu Arg Glu Phe Gin Glu 
260 265 270 

Val 

(2) INFORMATION FOR SEQ ID NO: 55: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 273 amino acids 

(B) TYPE : amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY : linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 55: 

Met Lys Lys Thr Gin Thr Trp lie lie Thr Cys lie Tyr Leu Gin Leu 
15 10 15 

Leu Leu Phe Asn Pro Leu Val Lys Thr Lys Glu lie Cys Gly Asn Pro 
20 25 30 

Val Thr Asp Asn Val Lys Asp lie Thr Lys Leu Val Ala Asn Leu Pro 
35 40 45 

Asn Asp Tyr Met lie Thr Leu Asn Tyr Val Ala Gly Met Asp Val Leu 
50 55 60 

Pro Ser His Cys Trp Leu Arg Asp Met Val lie Gin Leu Ser Leu Ser 
65 70 75 80 

Leu Thr Thr Leu Leu Asp Lys Phe Ser Asn lie Ser Glu Gly Leu Ser 
85 90 95 

Asn Tyr Ser lie lie Asp Lys Leu Gly Lys lie Val Asp Asp Leu Val 
100 105 110 

Leu Cys Met Glu Glu Asn Ala Pro Lys Asn lie Lys Glu Ser Pro Lys 
115 120 125 

Arg Pro Glu Thr Arg Ser Phe Thr Pro Glu Glu Phe Phe Ser lie Phe 
130 135 140 

Asn Arg Ser lie Asp Ala Phe Lys Asp Phe Met Val Ala Ser Asp Thr 
145 150 155 160 

Ser Asp Cys Val Leu Ser Ser Thr Leu Gly Pro Glu Lys Asp Ser Arg 
165 170 175 



Val Ser Val Thr Lys Pro Phe Met Leu Pro Pro Val Ala Ala Ser Ser 
180 185 190 



# 
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Leu Arg Asn Asp Ser Ser Ser Ser Asn Arg Lys Ala Ala Lys Ala Pro 

195 200 205 

Glu Asp Ser Gly Leu Gin Trp Thr Ala Met Ala Leu Pro Ala Leu lie 
210 215 220 

Ser Leu Val lie Gly Phe Ala Phe Gly Ala Leu Tyr Trp Lys Lys Lys 
225 230 235 240 

Gin Ser Ser Leu Thr Arg Ala Val Glu Asn He Gin He Asn Glu Glu 
245 250 255 

Asp Asn Glu He Ser Met Leu Gin Gin Lys Glu Arg Glu Phe Gin Glu 
260 265 270 

Val 

(2) INFORMATION FOR SEQ ID NO: 56: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 282 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 56: 

Thr Trp He lie Thr Cys Phe Cys Leu Gin Leu Leu Leu Leu Asn Pro 
15 10 15 

Leu Val Lys Ala Gin Ser Ser Cys Gly Asn Pro Val Thr Asp Asp Val 
20 25 30 

Asn Asp He Ala Lys Leu Val Gly Asn Leu Pro Asn Asp Tyr Leu He 
35 40 45 

Thr Leu Lys Tyr Val Pro Lys Met Asp Ser Leu Pro Asn His Cys Trp 
50 55 60 

Leu His Leu Met Val Pro Glu Phe Ser Arg Ser Leu His Asn Leu Leu 
65 70 75 80 

Gin Lys Phe Ser Asp He Ser Asp Met Ser Asp Val Leu Ser Asn Tyr 
85 90 95 

Ser He He Asn Asn Leu Thr Arg He He Asn Asp Leu Met Ala Cys 
100 105 110 

Leu Ala Phe Asp Lys Asn Lys Asp Phe He Lys Glu Asn Gly Leu His 
115 120 125 



Tyr Glu Glu Asp Arg Phe He Pro Glu Asn Phe Phe Arg Leu Phe Asn 
130 135 140 
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Ser Thr lie Glu Val Tyr Lys Glu Phe Ala Asp Ser Leu Asp Lys Asn 
145 150 155 160 

Asp Cys lie Met Pro Ser Thr Val Glu Thr Pro Glu Asn Asp Ser Arg 
165 170 175 

Val Ala Val Thr Lys Thr lie Ser Phe Pro Pro Val Ala Ala Ser Ser 
180 185 190 

Leu Arg Asn Asp Ser lie Gly Ser Asn Thr Ser Ser Asn Ser Asn Lys 
195 200 205 

Glu Ala Leu Gly Phe lie Ser Ser Ser Ser Leu Gin Gly lie Ser lie 
210 215 220 

A. la Leu Thr Ser Leu Leu Ser Leu Leu lie Gly Phe lie Leu Gly Ala 
225 230 235 240 

lie Tyr Trp Lys Lys Thr His Pro Lys Ser Arg Pro Glu Ser Asn Glu 
245 250 255 

Thr lie Gin Cys His Gly Cys Gin Glu Glu Asn Glu lie Ser Met Leu 
260 265 270 

Gin Gin Lys Glu Lys Glu His Leu Gin Val 

275 280 

(2) INFORMATION FOR SEQ ID NO: 57: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 266 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 57: 

Met Lys Lys Thr Gin Thr Trp lie lie Thr Cys lie Tyr Leu Gin Leu 
15 10 15 

Leu Leu Phe Asn Pro Leu Val Lys Thr Gly lie Cys Arg Asn Arg Val 
20 25 30 

Thr Asp Val Lys Asp Val Thr Lys Leu Val Ala Asn Leu Pro Lys Asp 
35 40 45 

Tyr Met lie Thr Leu Lys Tyr Val Pro Gly Met Asp Val Leu Pro Ser 
50 55 60 

His Cys Trp lie Ser Glu Met Val Glu Gin Leu Ser Val Ser Leu Thr 
65 70 75 80 
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Asp Leu Leu Asp Lys Phe Ser Asn 
85 

Ser lie lie Asp Lys Leu Val Lys 
100 

Glu Glu Asn Ser Ser Lys Asn Val 
115 120 

Arg Leu Phe Thr Pro Glu Glu Phe 
130 135 

Asp Ala Phe Lys Asp 
145 

Val Ser Ser Thr Leu Ser Pro Glu 
165 

Lys Pro Phe Met Leu Pro Pro Val 
180 

Ser Ser Ser Ser Asn Arg 
195 

Ala Ala Met Ala Leu Pro Ala Leu 
210 215 

Phe Gly Ala Leu Tyr Trp Lys Lys 
225 230 

Val Glu Asn lie Gin lie Asn Glu 
245 

Gin Glu Lys Glu Arg Glu Phe Gin 
260 

(2) INFORMATION FOR SEQ ID NO: 58: 



lie Ser Glu Gly Leu Ser Asn Tyr 

90 95 

lie Val Asp Asp Leu Val Glu Cys 
105 110 

Lys Lys Ser Lys Ser Pro Glu Pro 
125 

Phe Arg lie Phe Asn Arg Ser lie 
140 



Lys Asp Stir Arg Vdi St^r Veil Thr 

170 175 

Ala Ala Ser Ser Leu Arg Asn Asp 
185 190 



Phe Ser Leu Val lie Gly Phe Ala 

220 

Lys Gin Pro Ser Leu Thr Arg Ala 
235 240 

Glu Asp Asn Glu lie Ser Met Leu 
250 255 

Glu Val 
265 



Phe Met Val Ala Ser Lys Thr Ser Asp Cys Val 
150 155 160 



Lys Ala Asn Glu Asp Ser Ser Leu Gin Trp 
200 205 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 269 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE : DNA 

( ix) FEATURE : 

(A) NAME / KEY : CDS 

(B) LOCATION: j oin ( 1 . . 210 , 223.-258) 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 58: 



GAA TTC TTC CGT ATC TTC AAC CGT TCC ATC GAC GCT TTC AAA GAC TTC 
Glu Phe Phe Arg lie Phe Asn Arg Ser lie Asp Ala Phe Lys Asp Phe 
15 10 15 



48 
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GTT GTT GCT TCC GAA ACC TCC GAC TGC GTT GTT TGC TCC ACC CTG TCT 9 6 

Val Val Ala Ser Glu Thr Ser Asp Cys Val Val Ser Ser Thr Leu Ser 
20 25 30 

CCG GAA AAA GAC TCC CGT GTT TCG GTT ACC AAA CCG TTC ATG CTG CCG 14 4 

Pro Glu Lys Asp Ser Arg Val Ser Val Thr Lys Pro Phe Met Leu Pro 
35 40 45 

CCG GTT GCT GCT TCC TCC CTG CGT AAC GAC TCC TCC TCC TCC AAC TCC 192 
Pro Val Ala Ala Ser Ser Leu Arg Asn Asp Ser Ser Ser Ser Asn Ser 
50 55 60 

AAA TAC ATC TAC CTG ATC TAATAGGATC CG GTT ACC AAA CCG TTC ATG 24 0 

Lys Tyr lie Tyr Leu lie Val Thr Lys Pro Phe Met 

65 70 75 

CTG CCG CCG GTT GCT GCT TAATAGGATC C 26 9 

Leu Pro Pro Val Ala Ala 
80 

(2) INFORMATION FOR SEQ ID NO : 5 9 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 82 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE : protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 5 9 : 

Glu Phe Phe Arg lie Phe Asn Arg Ser lie Asp Ala Phe Lys Asp Phe 
15 10 15 

Val Val Ala Ser Glu Thr Ser Asp Cys Val Val Ser Ser Thr Leu Ser 
20 25 30 

Pro Glu Lys Asp Ser Arg Val Ser Val Thr Lys Pro Phe Met Leu Pro 
35 40 45 

Pro Val Ala Ala Ser Ser Leu Arg Asn Asp Ser Ser Ser Ser Asn Ser 
50 55 60 

Lys Tyr lie Tyr Leu lie Val Thr Lys Pro Phe Met Leu Pro Pro Val 
65 70 75 80 

Ala Ala 

(2) INFORMATION FOR SEQ ID NO : 6 0 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1404 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 
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(n) MOLECULE TYPE : DNA 

(ix) FEATURE : 

(A) NAME /KEY: CDS 

(B) LOCATION: 184.. 1002 

(ix) FEATURE: 

(A) NAME /KEY : mat_peptide 

(B) LOCATION: 2 5 9.. 10 02 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 6 0 : 

CCGCCTCGCG CCGAGACTAG AAGCGCTGCG GGAAGCAGGG ACAGTGGAGA GGGCGCTGCG 6 0 

CTCGGGCTAC CCAATGCGTG GACTATCTGC CGCCGCTGTT CGTGCAATAT GCTGGAGCTC 12 0 

CAGAACAGCT AAACGGAGTC GCCACACCAC TGTTTGTGCT GG AT C G C AG C GCTGCCTTTC 180 

CTT ATG AAG AAG ACA CAA ACT TGG ATT CTC ACT TGC ATT TAT CTT CAG 228 
Met Lys Lys Thr Gin Thr Trp lie Leu Thr Cys lie Tyr Leu Gin 
-25 -20 -15 

CTG CTC CTA TTT AAT CCT CTC GTC AAA ACT GAA GGG ATC TGC AGG AAT 2 76 

Leu Leu Leu Phe Asn Pro Leu Val Lys Thr Glu Gly lie Cys Arg Asn 
-10 -5 15 

CGT GTG ACT AAT AAT GTA AAA GAC GTC ACT AAA TTG GTG GCA AAT CTT 3 24 

Arg Val Thr Asn Asn Val Lys Asp Val Thr Lys Leu Val Ala Asn Leu 
10 15 20 

CCA AAA GAC TAC ATG ATA ACC CTC AAA TAT GTC CCC GGG ATG GAT GTT 3 72 

Pro Lys Asp Tyr Met lie Thr Leu Lys Tyr Val Pro Gly Met Asp Val 
25 30 35 

TTG CCA AGT CAT TGT TGG ATA AGC GAG ATG GTA GTA CAA TTG TCA GAC 42 0 

Leu Pro Ser His Cys Trp lie Ser Glu Met Val Val Gin Leu Ser Asp 
40 45 50 

AGC TTG ACT GAT CTT CTG GAC AAG TTT TCA AAT ATT TCT GAA GGC TTG 468 
Ser Leu Thr Asp Leu Leu Asp Lys Phe Ser Asn lie Ser Glu Gly Leu 
55 60 65 70 

AGT AAT TAT TCC ATC ATA GAC AAA CTT GTG AAT ATA GTG GAT GAC CTT 516 
Ser Asn Tyr Ser lie lie Asp Lys Leu Val Asn lie Val Asp Asp Leu 
75 80 85 

GTG GAG TGC GTG AAA GAA AAC TCA TCT AAG GAT CTA AAA AAA TCA TTC 5 64 

Val Glu Cys Val Lys Glu Asn Ser Ser Lys Asp Leu Lys Lys Ser Phe 
90 95 100 

AAG AGC CCA GAA CCC AGG CTC TTT ACT CCT GAA GAA TTC TTT AGA ATT 612 
Lys Ser Pro Glu Pro Arg Leu Phe Thr Pro Glu Glu Phe Phe Arg lie 
105 110 115 
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TTT AAT AGA TCC ATT GAT GCC TTC AAG GAC TTT GTA GTG GCA TCT GAA 
Phe Asn Arg Ser lie Asp Ala Phe Lys Asp Phe Val Val Ala 3er Glu 
120 125 130 

ACT AGT GAT TGT GTG GTT TCT TCA ACA TTA AGT CCT GAG AAA GAT TCC 
Thr Ser Asp Cys Val Val Ser Ser Thr Leu Ser Pro Glu Lys Asp Ser 
135 140 145 150 

AGA GTC AGT GTC ACA AAA CCA TTT ATG TTA CCC CCT GTT GCA GCC AGC 
Arg Val Ser Val Thr Lys Pro Phe Met Leu Pro Pro Val Ala Ala Ser 
155 160 165 

TCC CTT AGG AAT GAC AGC AGT AGC AGT AAT AGG AAG GCC AAA AAT CCC 
Ser Leu Arg Asn Asp Ser Ser Ser Ser Asn Arg Lys Ala Lys Asn Pro 
170 175 180 

CCT GGA GAC TCC AGC CTA CAC TGG GCA GCC ATG GCA TTG CCA GCA TTG 
Pro Gly Asp Ser Ser Leu His Trp Ala Ala Met Ala Leu Pro Ala Leu 
185 190 195 

TTT TCT CTT ATA ATT GGC TTT GCT TTT GGA GCC TTA TAC TGG AAG AAG 
Phe Ser Leu lie lie Gly Phe Ala Phe Gly Ala Leu Tyr Trp Lys Lys 
200 205 210 

AGA CAG CCA AGT CTT ACA AGG GCA GTT GAA AAT ATA CAA ATT AAT GAA 
Arg Gin Pro Ser Leu Thr Arg Ala Val Glu Asn lie Gin lie Asn Glu 
215 220 225 230 

GAG GAT AAT GAG ATA AGT ATG TTG CAA GAG AAA GAG AGA GAG TTT CAA 
Glu Asp Asn Glu lie Ser Met Leu Gin Glu Lys Glu Arg Glu Phe Gin 
235 240 245 

GAA GTG TAATTGTGGC TTGTAT CAAC ACTGTTACTT TCGTACATTG GCTGGTAACA 
Glu Val 



GTTCATGTTT GCTTCATAAA TGAAGCAGCT TTAAACAAAT TCATATTCTG TCTGGAGTGA 

CAGACCACAT CTTTATCTGT TCTTGCTACC CATGACTTTA TATGGATGAT TCAGAAATTG 

GAACAGAATG TTTTACTGTG AAACTGGCAC TGAATTAATC AT C TAT AAA G AAGAACTTGC 

ATGGAGCAGG ACTCTATTTT AAGGACTGCG GGACTTGGGT C T CAT T TAG A ACTTGCAGCT 

GATGTTGGAA GAGAAAGCAC GTGTCTCAGA CTGCATGTAC CATTTGCATG GCTCCAGAAA 

TGTCTAAATG CTGAAAAAAC ACCTAGCTTT ATTCTTCAGA TACAAACTGC AG 
(2) INFORMATION FOR SEQ ID NO : 6 1 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH : 2 73 amino acids 

(B) TYPE : amino acid 
( D ) TOPOLOGY : 1 inear 



660 

708 

756 

804 

852 

900 

948 

996 

1052 

1112 
1172 
1232 
1292 
1352 
1404 



(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 6 1 : 

Met Lys Lys Thr Gin Thr Trp lie Leu Thr Cys lie Tyr Leu Gin Leu 
-2 5 -20 -15 -10 

Leu Leu Phe Asn Pro Leu Val Lys Thr Glu Gly lie Cys Arg Asn Arg 
-5 1 5 

Val Thr Asn Asn Val Lys Asp Val Thr Lys Leu Val Ala Asn Leu Pro 
10 15 20 

Lys Asp Tyr Met lie Thr Leu Lys Tyr Val Pro Gly Met Asp Val Leu 
25 30 35 

Pro Ser His Cys Trp lie Ser Glu Met Val Val Gin Leu Ser Asp Ser 
40 45 50 55 

Leu Thr Asp Leu Leu Asp Lys Phe Ser Asn lie Ser Glu Gly Leu Ser 
60 65 70 

Asn Tyr Ser lie lie Asp Lys Leu Val Asn lie Val Asp Asp Leu Val 
75 80 85 

Glu Cys Val Lys Glu Asn Ser Ser Lys Asp Leu Lys Lys Ser Phe Lys 
9 0 9 5 10 0 

Ser Pro Glu Pro Arg Leu Phe Thr Pro Glu Glu Phe Phe Arg lie Phe 
105 110 115 

Asn Arg Ser lie Asp Ala Phe Lys Asp Phe Val Val Ala Ser Glu Thr 
120 125 130 135 

Ser Asp Cys Val Val Ser Ser Thr Leu Ser Pro Glu Lys Asp Ser Arg 
140 145 150 

Val Ser Val Thr Lys Pro Phe Met Leu Pro Pro Val Ala Ala Ser Ser 
155 160 165 

Leu Arg Asn Asp Ser Ser Ser Ser Asn Arg Lys Ala Lys Asn Pro Pro 
170 175 180 

Gly Asp Ser Ser Leu His Trp Ala Ala Met Ala Leu Pro Ala Leu Phe 
185 190 195 

Ser Leu lie lie Gly Phe Ala Phe Gly Ala Leu Tyr Trp Lys Lys Arg 

200 205 210 215 

Gin Pro Ser Leu Thr Arg Ala Val Glu Asn lie Gin lie Asn Glu Glu 
220 225 230 



Asp Asn Glu lie Ser Met Leu Gin Glu Lys Glu Arg Glu Phe Gin Glu 
235 240 245 
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Val 

(2) INFORMATION FOR SEQ ID NO: 62: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1088 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(ix) FEATURE: 

(A) NAME / KEY : CDS 

(B) LOCATION: 151.. 885 

(ix) FEATURE: 

(A) NAME/KEY: mat_peptide 

(B) LOCATION: 226.. 885 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 62: 

AGCAGGGACA GTGGAGAGGG CGCTGCGCTC GGGCTACCCA ATGCGTGGAC TATCTGCCGC 6 0 

CGCTGTTCGT GCAATATGCT GGAGCTCCAG AACAGCTAAA CGGAGTCGCC ACACCACTGT 12 0 

TTGTGCTGGA TCGCAGCGCT GCCTTTCCTT ATG AAG AAG ACA CAA ACT TGG ATT 174 

Met Lys Lys Thr Gin Thr Trp lie 
-25 -20 

CTC ACT TGC ATT TAT CTT CAG CTG CTC CTA TTT AAT CCT CTC GTC AAA 2 22 

Leu Thr Cys lie Tyr Leu Gin Leu Leu Leu Phe Asn Pro Leu Val Lys 
-15 -10 -5 

ACT GAA GGG ATC TGC AGG AAT CGT GTG ACT AAT AAT GTA AAA GAC GTC 2 70 

Thr Glu Gly lie Cys Arg Asn Arg Val Thr Asn Asn Val Lys Asp Val 
15 10 15 

ACT AAA TTG GTG GCA AAT CTT CCA AAA GAC TAC ATG ATA ACC CTC AAA 318 
Thr Lys Leu Val Ala Asn Leu Pro Lys Asp Tyr Met lie Thr Leu Lys 
20 25 30 

TAT GTC CCC GGG ATG GAT GTT TTG CCA AGT CAT TGT TGG ATA AGC GAG 366 
Tyr Val Pro Gly Met Asp Val Leu Pro Ser His Cys Trp lie Ser Glu 
35 40 45 

ATG GTA GTA CAA TTG TCA GAC AGC TTG ACT GAT CTT CTG GAC AAG TTT 414 
Met Val Val Gin Leu Ser Asp Ser Leu Thr Asp Leu Leu Asp Lys Phe 
50 55 60 

TCA AAT ATT TCT GAA GGC TTG AGT AAT TAT TCC ATC ATA GAC AAA CTT 4 62 

Ser Asn lie Ser Glu Gly Leu Ser Asn Tyr Ser lie lie Asp Lys Leu 
65 70 75 



GTG AAT ATA GTG GAT GAC CTT GTG GAG TGC GTG AAA GAA AAC TCA TCT 510 
Val Asn lie Val Asp Asp Leu Val Glu Cys Val Lys Glu Asn Ser Ser 
80 85 90 95 

AAG GAT CTA AAA AAA TCA TTC AAG AGC CCA GAA CCC AGG CTC TTT ACT 5 58 

Lys Asp Leu Lys Lys Ser Phe Lys Ser Pro Glu Pro Arg Leu Phe Thr 
100 105 110 

CCT GAA GAA TTC TTT AGA ATT TTT AAT AGA TCC ATT GAT GCC TTC AAG 6 06 

Pro Glu Glu Phe Phe Arg lie Phe Asn Arg Ser lie Asp Ala Phe Lys 
115 120 125 

GAC TTT GTA GTG GCA TCT GAA ACT AGT GAT TGT GTG GTT TCT TCA ACA 6 54 

Asp Phe Val Val Ala Ser Glu Thr Ser Asp Cys Val Val Ser Ser Thr 
130 135 140 

TTA AGT CCT GAG AAA GGG AAG GCC AAA AAT CCC CCT GGA GAC TCC AGC 7 02 

Leu Ser Pro Glu Lys Gly Lys Ala Lys Asn Pro Pro Gly Asp Ser Ser 
145 150 155 

CTA CAC TGG GCA GCC ATG GCA TTG CCA GCA TTG TTT TCT CTT ATA ATT 7 50 

Leu His Trp Ala Ala Met Ala Leu Pro Ala Leu Phe Ser Leu lie lie 
160 165 170 175 

GGC TTT GCT TTT GGA GCC TTA TAC TGG AAG AAG AGA CAG CCA AGT CTT 7 98 

Gly Phe Ala Phe Gly Ala Leu Tyr Trp Lys Lys Arg Gin Pro Ser Leu 
180 185 190 

ACA AGG GCA GTT GAA AAT ATA CAA ATT AAT GAA GAG GAT AAT GAG ATA 84 6 

Thr Arg Ala Val Glu Asn lie Gin lie Asn Glu Glu Asp Asn Glu lie 
195 200 205 

AGT ATG TTG CAA GAG AAA GAG AGA GAG TTT CAA GAA GTG TAATTGTGGC 8 95 

Ser Met Leu Gin Glu Lys Glu Arg Glu Phe Gin Glu Val 
210 215 220 

TTGTATCAAC ACTGTTACTT TCGTACATTG GCTGGTAACA GTTCATGTTT GCTTCATAAA 95 5 

TGAAGCAGCT TTAAACAAAT TCATATTCTG TCTGGAGTGA CAGACCACAT CTTTATCTGT 1015 

TCTTGCTACC CATGACTTTA TATGGATGAT TCAGAAATTG G AAC AG AAT G TTTTACTGTG 10 75 

AAACTGGCAC TGA 10 88 

(2) INFORMATION FOR SEQ ID NO : 6 3 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 5 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 6 3 : 
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Met Lys Lys Thr Gin Thr Trp lie Leu Thr Cys lie Tyr Leu Gin Leu 

-25 -20 -15 -10 



Leu Leu Phe Asn Pro Leu Val Lys Thr Glu Gly lie Cys Arg Asn Arg 
-5 1 5 

Val Thr Asn Asn Val Lys Asp Val Thr Lys Leu Val Ala Asn Leu Pro 
10 15 20 

Lys Asp Tyr Met lie Thr Leu Lys Tyr Val Pro Gly Met Asp Val Leu 
25 30 35 

Pro Ser His Cys Trp lie Ser Glu Met Val Val Gin Leu Ser Asp Ser 
40 45 50 55 



± nl i-L^?p j_j*z:u. -!_■■ 



60 



70 



Asn Tyr Ser lie 
75 

Glu Cys Val Lys 
90 

Ser Pro Glu Pro 
105 

Asn Arg Ser lie 
120 

Ser Asp Cys Val 



Lys Asn Pro Pro 
155 

Pro Ala Leu Phe 
170 

Trp Lys Lys Arg 
185 

lie Asn Glu Glu 
200 



lie Asp Lys Leu 



Glu Asn Ser Ser 
95 

Arg Leu Phe Thr 
110 

Asp Ala Phe Lys 
125 

Val Ser Ser Thr 
140 

Gly Asp Ser Ser 



Ser Leu lie lie 
175 

Gin Pro Ser Leu 
190 

Asp Asn Glu lie 

205 



Val Asn lie Val 
80 

Lys Asp Leu Lys 



Pro Glu Glu Phe 
115 

Asp Phe Val Val 
130 

Leu Ser Pro Glu 
145 

Leu His Trp Ala 
160 

Gly Phe Ala Phe 



Thr Arg Ala Val 
195 

Ser Met Leu Gin 
210 



Asp Asp Leu Val 
85 

Lys Ser Phe Lys 
100 

Phe Arg lie Phe 



Ala Ser Glu Thr 
135 

Lys Gly Lys Ala 
150 

Ala Met Ala Leu 
165 

Gly Ala Leu Tyr 
180 

Glu Asn He Gin 



Glu Lys Glu Arg 
215 



Glu Phe Gin Glu Val 
220 



(2) INFORMATION FOR SEQ ID NO: 64: 



(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 47 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

( D ) TOPOLOGY : linear 



# 
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(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 64: 

Glu Glu lie Cys Arg Asn Pro Val Thr Asp Asn Val Lys Asp lie Thr 
15 10 15 

Lys Leu Val Ala Asn Leu Pro Asn Asp Tyr Met lie Thr Leu Asn Tyr 
20 25 30 

Val Ala Gly Met Asp Val Leu Pro Ser His Xaa Trp Leu Arg Asp 
35 40 45 

(2) INFORMATION FOR SEQ ID NO : 6 5 : 

(A) LENGTH: 10 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

( D ) TOPOLOGY : linear 

(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 65: 

lie Thr Thr Leu Asn Tyr Val Ala Gly Met 
15 10 

(2) INFORMATION FOR SEQ ID NO: 66: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 26 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 66: 

Val Ala Ser Asp Thr Ser Asp Cys Val Leu Ser Xaa Xaa Leu Gly Pro 
15 10 15 

Glu Lys Asp Ser Arg Val Ser Val Xaa Lys 
20 25 

(2) INFORMATION FOR SEQ ID NO : 6 7 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 12 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 6 7 : 

Asp Val Leu Pro Ser His Cys Trp Leu Arg Asp Met 
15 10 

(2) INFORMATION FOR SEQ ID NO: 68: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 33 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

( Xx ) SEQUENCE DESCRIPTION: SEQ ID NO : 6 8 : 

Glu Glu Asn Ala Pro Lys Asn Val Glu Ser Leu Lys Lys Pro Thr Arg 
15 10 15 

Asn Phe Thr Pro Glu Glu Phe Phe Ser lie Phe Asp Arg Ser lie Asp 
20 25 30 

Ala 

(2) INFORMATION FOR SEQ ID NO: 69: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 9 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 6 9 : 

Glu Ser Leu Lys Lys Pro Glu Thr Arg 
1 5 

(2) INFORMATION FOR SEQ ID NO: 70: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 5 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 7 0 : 



Val Ser Val Xaa Lys 
1 5 



# 
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(2) INFORMATION FOR SEQ ID NO: 71: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 15 amino acids 

(B) TYPE: amino acid 

CO STRANDEDNESS : single 
(D) TOPOLOGY: linear 

<ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 71: 

lie Val Asp Asp Leu Val Ala Ala Met Glu Glu Asn Ala Pro Lys 
1 5 10 15 

(2) TN FORMAT I ON FOR SEQ ID NO: 72: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 13 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

( D ) TOPOLOGY : 1 inear 

(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 72: 

Asn Phe Thr Pro Glu Glu Phe Phe Ser lie Phe Xaa Arg 
15 10 

(2) INFORMATION FOR SEQ ID NO: 73: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 9 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

Cii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 73: 

Leu Val Ala Asn Leu Pro Asn Asp Tyr Met lie Thr Leu Asn Tyr Val 
15 10 15 

Ala Gly Asp Asp Val Leu Pro Ser His Cys Trp Leu Arg 
20 25 

(2) INFORMATION FOR SEQ ID NO: 74: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 
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(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 74: 

Ser lie Asp Ala Phe Lys Asp Phe Met Val Ala Ser Asp Thr Ser Asp 
1 5 10 15 

Cys Val Leu Ser Xaa Xaa Leu Gly 
20 

(2) INFORMATION FOR SEQ ID NO: 75: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 22 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 75: 

Glu Ser Leu Lys Lys Pro Glu Thr Arg Asn Phe Thr Pro Glu Glu Phe 
1 5 10 15 

Phe Ser lie Phe Xaa Arg 
20 

(2) INFORMATION FOR SEQ ID NO: 76: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 22 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 76: 

Glu Ser Leu Lys Lys Pro Glu Thr Arg Asn Phe Thr Pro Glu Glu Phe 
1 5 10 15 

Phe Ser lie Phe Asp Arg 
20 

(2) INFORMATION FOR SEQ ID NO: 77: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 8 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: peptide 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 7 7 : 

Asn Ala Pro Lys Asn Val Lys Glu 
1 5 

(2) INFORMATION FOR SEQ ID NO: 78: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 16 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(xi j SEQUENCE DESCRIPTION: SEQ ID NO: 78: 

Ser Arg Val Ser Val Xaa Lys Pro Phe Met Leu Pro Pro Val Ala Ala 
15 10 15 

(2) INFORMATION FOR SEQ ID NO: 79: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 34 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 79: 

Ser Leu Lys Lys Pro Glu Thr Arg Asn Phe Thr Pro Glu Glu Phe Phe 
15 10 15 

Ser lie Phe Asn Arg Ser lie Asp Ala Phe Lys Asp Phe Met Val Ala 
20 25 30 

Ser Asp 



(2) INFORMATION FOR SEQ ID NO: 80: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 7 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 8 0 : 



Ser Leu Lys Lys Pro Glu Thr Arg Asn Phe Thr Pro Glu Glu Phe Phe 
15 10 15 
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Ser He Phe Asn Arg Ser He Asp Ala Phe Lys Asp Phe Met Val Ala 
20 25 30 

Ser Asp Thr Ser Asp 
35 

(2) INFORMATION FOR SEQ ID NO: 81: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4 0 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 81: 

Leu Arg Asp Met Val Thr His Leu Ser Val Ser Leu Thr Thr Leu Leu 
1 5 10 15 

Asp Lys Phe Ser Asn He Ser Glu Gly Leu Ser Asn Tyr Ser He He 
20 25 30 

Asp Lys Leu Gly Lys He Val Asp 
35 40 

(2) INFORMATION FOR SEQ ID NO:82: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 16 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 82: 

Ser Arg Val Ser Val Thr Lys Pro Phe Met Leu Pro Pro Val Ala Ala 
1 5 10 15 

(2) INFORMATION FOR SEQ ID NO: 83: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY : linear 

(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 83 
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Pro Val Ala Ala 



1 



{2) INFORMATION FOR SEQ ID NO: 84: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 84: 
CCTGAGAAAG ATTCCAGAGTC 21 
(2) INFORMATION FOR SEQ ID NO: 85: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 19 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 8 5 : 
CTGCAGTTTG TATCTGAAG 19 
(2) INFORMATION FOR SEQ ID NO: 86: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 19 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 8 6 : 
CATATAAAGT CATGGGTAG 19 
(2) INFORMATION FOR SEQ ID NO : 8 7 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 27 base pairs 

(B) TYPE: nucleic acid 



(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: DNA 
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(XI) SEQUENCE DESCRIPTION: SEQ ID NO : 8 7 : 



ACTTGTGTCT TCTTCATAAG GAAAGGC 



27 



(2) INFORMATION FOR SEQ ID NO: 88: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 88: 
TGTACGAAAG TAACAGTGTT G 21 
(2) INFORMATION FOR SEQ ID NO : 8 9 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 22 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 89: 
ACTGCTCCTA TTTAATCCTC TC 2 2 

(2) INFORMATION FOR SEQ ID NO: 90: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 90: 
CACTGACTCT GGAATCTTTC TCA 2 3 

(2) INFORMATION FOR SEQ ID NO: 91: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 15 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 



• 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 91 : 
TCGACCCGGA TCCCC 

(2) INFORMATION FOR SEQ ID NO: 92: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 15 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY : linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 92 : 
TCGAGGGGAT CCGGG 

(2) INFORMATION FOR SEQ ID NO:93: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 5 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

( i i ) MOLECULE TYPE : DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 93: 
TCTTCTTCAT GGCGGCGGCA AGCTT 
(2) INFORMATION FOR SEQ ID NO: 94: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 18 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 94 : 

Asp Ser Arg Val Ser Val Xaa Lys Pro Phe Phe Met Leu Pro Pro Val 
15 10 15 

Ala Ala 



(2) INFORMATION FOR SEQ ID NO: 95: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 18 amino acids 

(B) TYPE : amino acid 
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(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 95: 

Asp Ser Arg Val Ser Val Thr Lys Pro Phe Phe Met Leu Pro Pro Val 
15 10 15 

Ala Ala 



(2) INFORMATION FOR SEQ ID NO: 96: 

(l) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 55 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 96: 
CGATTTGATT CTAGAAGGAG GAATAACATA TGGTTAACGC GTTGGAATTC GGTAC 
(2) INFORMATION FOR SEQ ID NO: 97: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4 9 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY : linear 

( i i ) MOLECULE TYPE : DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 97: 
CGAATTCCAA CGCGTTAACC ATATGTTATT CCTCCTTCTA GAATCAAAT 
(2) INFORMATION FOR SEQ ID NO: 98: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 9 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 98: 



TATGCAGGA 



9 
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(2) INFORMATION FOR SEQ ID NO : 9 9 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 11 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(xi) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 99: 
GATCTCCTGC A 

(2) INFORMATION FOR SEQ ID NO: 100: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 17 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 100 
TATGGAAGGT ATCTGCA 

(2) INFORMATION FOR SEQ ID NO: 101: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 11 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 101 
GATACCTTCC A 

(2) INFORMATION FOR SEQ ID NO: 102: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 102 




11 



17 



TTTCCTTATG 



10 
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(2) INFORMATION FOR SEQ ID NO : 1 0 3 : 

(l) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 12 base pairs 

(B) TYPE : nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 103 
GCCGCCGCCA TG 

(2) INFORMATION FOR SEQ ID NO: 104: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 25 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY : linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 104 
TCTTCTTCAT GGCGGCGGCA AGCTT 




12 



25 



